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VARIANT TLR4 NUCLEIC ACID AND USES THEREOF 

S Statement of Government Rig hts 

This invention was made at least in part with a grant from the 
Government of the United States of America (grants ES06537, ES07498 and 
ES05605 from the National Institute of Environmental Sciences, grant HL62628 
from the National Heart and Lung Institute, and grant RR00059 from the 
10 General Clinical Research Centers Program). The Government may have certain 
rights in the invention. 

Background of the Invention 

Endotoxin or lipopolysaccharide (LPS), released firom the cell wall of 

15 bacteria, plays a central role in a broad spectrum of human disease. The 
pathogenic importance of LPS in gram-negative sepsis is well established. 
Intravenous LPS induces all of the clinical features of gram-negative sepsis, 
including fever, shock, leukopenia followed by leukocytosis, and disseminated 
intravascular coagulation (Favorite et al., 1942). Higher concentrations of 

20 circulating levels of endotoxin have been associated with manifestations of 
systemic inflammatory response sjmdrome (Wang et al., 1995) and the 
development of acute respiratory distress syndrome following sepsis (Brigham et 
al., 1986). Inhaled endotoxin can induce airflow obstruction in naive or 
previously unexposed subjects (Michel et al., 1992) and is the most important 

25 occupational exposure associated with the development Schwartz et al., 1995a), 
and progression (Schwartz et al., 1995b), of airway disease among exposed 
workers. The concentration of endotoxin in the domestic setting appears to be 
associated with the clinical severity of asthma (Michel et al., 1996). Moreover, 
recent studies have shown that endotoxin is a contaminant of particulate matter 

30 in air pollution and may play a role in the pathophysiologic consequences of air 
pollution (Bonner et al., 1998). Thus, endotoxin is an important cause of 
morbidity and mortality. 

The ability of the host to respond to endotoxin may play an important 
role in determining the severity of the physiologic and biologic response to this 

35 frequently encountered toxin. In mice, genetic differences in susceptibihty to 
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frequently encountered toxin. In mice, genetic differences in susceptibility to 
LPS have been established. LPS hyporesponsiveness arose spontaneously and 
was first identified in the C3H/HeJ strain. This strain had an LD50 for LPS at 
least 20 times that observed in A/HeJ mice (Sultzer et al, 1968). In addition to 
5 C3H/HeJ, two other mouse strains, C57BL/10ScCR (Coutinho et al., 1978) and 
its progenitor strain, C57BL/10ScN, (Vogel et al.,1979), are hyporesponsive to 
LPS. 

Moreover, several reports suggest that humans may also respond 
differently to LPS. A patient with recurrent bacterial infections has been 
1 0 reported to be refractory to the in vivo and in vitro effects of LPS (Kuhns et al., 

1997) . Following challenge with intravenous LPS, this patient had no systemic 
increase in IL-6 or G-CSF and had a minimal rise in the concentration of 
leukocytes, TNF-a, and IL-8. Inter-individual differences have also been 
reported in the release and synthesis of cytokines by monocytes stimulated with 

1 5 LPS in vitro (Santamaria et al, 1 989). 

LPS is thought to cause much of its morbidity and mortality by activating 
kinases (DeFranco et al., 1998) that control the function of transcription factors 
(nuclear factor-KB and AP-1) and ultimately lead to production of pro- 
inflammatory cytokines and co-stimulatory molecules (Wright, 1999). Several 

20 lines of evidence suggest that the toll receptor (TLR) family, and specifically 
TLR4 and TLR2 regulate the interaction between LPS and intracellular kinases 
and may serve as a proximal target to interrupt LPS signaling (Wright, 1998; 
Medzhitov et al, 1997). Both TLR4 and TLR2 activate signaling through NF- 
kB and AP-1 in transfected hiiman cell lines (Medzhitov et al., 1997; Yang et al., 

25 1998), and TLR4 mediates LPS induced signal transduction (Chow et al., 1999). 
CD 14, a glycosylphosphatidyl inositol-linked receptor that binds LPS (Poltorak 
et al., 1998a) enhances LPS induced TLR2 (Yanget al., 1998) and TLR4 (Chow 
et al., 1 998) signaling, suggesting that the toll receptors interact with CD 14 to 
initiate the cellular response to LPS. Studies in mice indicate that 1) the TLR4 

30 gene maps to the critical region in LPS hyporesponsive mice (PoUorak et al., 

1998) , 2) mutations in the TLR4 gene (Poltorak et al., 1998; Qureshi et al., 1999) 
are found in mouse strains (C3H/HeJ and C57BL10/ScCr) that are defective in 
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their response to LPS, and 3) disruption of the TLR4 gene results in a LPS 
hyporesponsive phenotype (Hoshino et aL, 1999). 

Thus, there is need to determine whether the human TLR4 gene is 
polymorphic, and whether any particular polymorphism is associated with 
5 disease, e.g., LPS hyporesponsiveness. 

Summary of the Invention 

The invention provides a method to identify a mammal, e.g., a human, at 
risk of, or having, an indication associated with altered innate immunity, e.g., to 

10 bacterial infection. The method comprises contacting an amount of DNA 
obtained from a human physiological sample with an amount of at least one 
TLR4-specific oligonucleotide under conditions effective to amplify the DNA so 
as to yield amplified DNA. Then it is determined whether the amplified DNA 
comprises a nucleotide substitution, e.g., one that results in an amino acid 

1 5 substitution, i.e., the TLR4 DNA of the human encodes a variant TLR4. Thus, 
the invention is useful to detect polymorphisms in the TLR4 gene. 

Normal healthy, non-asthmatic subjects demonstrate a reproducible 
airway response to an incremental LPS inhalation challenge test, with some 
subjects developing airflow obstruction when challenged with low 

20 concentrations of LPS and others virtually unaffected by high concentrations of 
inhaled LPS. These findings suggest that the spectrum of LPS responsiveness in 
humans is quite variable from one individual to the next (but reproducible within 
an individual), and that a substantial portion of the population may be 
hyporesponsive to inhaled LPS. As described hereinbelow, an incremental LPS 

25 inhalation challenge test was employed to reliably phenotype individuals as 

either responsive (at least a 20 % decline in the forced expiratory volume in one 
second (FEV,) after inhaling up to 41.5 ng LPS) or hyporesponsive (FEV, > 80 
% of their baseline after inhahng 41 .5 ^ig of LPS) to inhaled LPS. Fifty-two (63 
%) of these individuals were responsive to inhaled LPS and 31 (37 %) were 

30 hyporesponsive to inhaled LPS. 

These results were employed to determine the relationship between 
polymorphisms in the TLR4 gene and the airway response to inhaled LPS in the 
83 normal healthy, non-asthmatic subjects. Using single stranded 

3 
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conformational variant (SSCV) analysis and direct sequencing, a missense 
. mutation (A896G) was identified in the fourth exon of the TLR4 gene that 
results in replacement of a conserved aspartic acid residue with glycine at 
position 299 in the extracellular domain of the TLR4 receptor. The Asp299Gly 
5 sequence variant occurred in 3 LPS responsive (5.8 %) and 7 LPS 

hyporesponsive (22.6 %) study subjects (p=0.03). Among the subjects with the 
common TLR4 allele (N=73), the dose-response slope (percent decline 
FEV /cumulative dose of inhaled LPS) averaged a L86 % decline in FEV,/ng 
inhaled LPS (range 0.01 %- 19.78 %), while the dose-response slope for the 

1 0 subjects with the Asp299Gly allele (N=l 0) was significantly less (p=0.007), 

averaging 0.59 % decline in FEV,/^g inhaled LPS (range 0.00 %-1.59 %). Thus, 
a sequence polymorphism in the TLR4 gene, i.e., a missense mutation 
(Asp299Gly) in the fourth exon of the TLR4 gene, occurs in a substantial portion 
of the population, and is associated with an airway hyporesponsive in humans 

1 5 challenged with inhaled LPS. The allelic frequency of the A896G substitution 
was 6.6 % in the study population, 7.9 % in a normal control population from 
Iowa (Lidral et al., 1998), and 3.3 % in the parental chromosomes of the CEPH 
population (NIH-CEPH, 1992). As also described herein, the presence of a 
TLR4 mutation was associated with gram negative sepsis, severity of sepsis, pre- 

20 term delivery, and respiratory distress syndrome in pre-term infants. 

The invention also provides an isolated and purified nucleic acid 
molecule comprising a nucleic acid segment, e.g., genomic DNA or cDNA, 
encoding TLR4, such as a variant TLR4. Also provided are primers, 
oligonucleotides and probes comprising the isolated nucleic acid sequences of 

25 the invention. The nucleic acid molecules of the invention may be single 
stranded or double stranded. 

Transfection of CHO cells with either the wild-type or the mutant 
(Asp299Gly) allele of the TLR4 gene demonstrated that this mutation interrupts 
TLR4-mediated LPS signaling. Moreover, the wild-type allele of TLR4 rescues 

30 the LPS hyporesponsive phenotype in either airway epithelial cells or alveolar 
macrophages obtained from individuals with the TLR4 mutation. Thus, these 
results provide the first genetic evidence that a common mutation causes 
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differences in LPS responsiveness that may contribute to several disease states in 
humans. 

Therefore, the invention further provides an expression cassette 
comprising a nucleic acid molecule of the invention, a host cell transformed with 
5 the expression cassette, and TLR4 polypeptides isolated therefrom. The 
transformed host cells, or isolated TLR4 polypeptides, may be useful in 
identifying agents that modulate, i.e., enhance or inhibit, TLR4 activity. For 
example, an expression cassette comprising a nucleic acid molecule of the 
invention which encodes a variant TLR4 polypeptide is introduced to murine 

10 cells, e.g., oocytes via microinjection (see Sigmund et al., 1993). The resulting 
pups are screened for the presence of the nucleic acid molecule. Hence, the 
invention also provides a transgenic mouse, the genome of the cells of which is 
augmented with variant human TLR4 DNA. Human TLR4 transgenic mice of 
the invention have altered innate inununity, e.g., they are more susceptible to 

1 5 gram negative sepsis than their nontransgenic counterparts. 

The invention also provides a method to treat an individual at risk of, or 
having, an indication associated with altered innate immunity, in which an agent 
that alters TLR4 activity is administered to the individual. 

20 Brief Description of the Figures 

Figure 1 . Genomic structure for the human TLR4 gene (exons are 
represented by boxes and introns are represented by lines). The coding sequence 
is in black and the nucleotides encoding the transmembrane domain (TM) are 
shaded. The positions of the introns in the published TLR4 cDNA sequence 

25 (Rock et al., 1998; Genbank Accession No. U88880) are listed. The exon (caps) 
and intron (lower case) sequences at each of the splice junctions is shown 
(sequences at 5* and 3' splice junctions for intron 1, SEQ ID NO: 1 and SEQ ID 
NO:63, respectively; for intron 2, SEQ ID N0:2 and SEQ ID NO:64^, 
respectively; and for intron 3, SEQ ID N0:3 and SEQ ID NO:65, respectively). 

30 Figure 2. The human TLR4 gene is alternatively spliced. Photographs 

are of agarose gels containing the products amplified from a multi-tissue cDNA 
panel from human adult (panel a; Clontech #K 1420-1, Palo Alto, CA), and 
human fetal (panel b; Clontech #K1425-1, Palo Alto, CA) cDNA. The tissues 

5 
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included brain (B), heart (H)» kidney (K), liver (Li), lung (Lu), pancreas (Pa), 
placenta (PI), skeletal muscle (Sk), spleen (Sp), and thymus (T). The first lane 
of each gel included a 100 bp molecular weight standard (MW). The forward 
and reverse PCR primers were derived from exons 1 and 4, respectively (see 

5 panel c) and amplified three products of 453, 333, and 167 bp. The DNA 

sequence for these bands showed that both exons 2 and 3 were present in the 453 
bp band, that exon 2 was absent in the 333 bp product, and that both exons 2 and 
3 were absent in the 167 bp band. The 453 bp and 333 bp sequences are 
identical to previously published sequences for the human TLR cDNA 

10 (Medzhitov et al., 1997; Rock et aL, 1998). The open, closed, and shaded boxes 
indicate the untranslated, translated, and transmembrane domain portions of the 
TLR4 exons, respectively. The ends of the cDNAs were arbitrarily terminated at 
the stop codon (STP). 

Figure 3. Mutations in the human TLR4 gene. The SSCV and sequence 

15 analysis were performed blinded to the LPS response phenotype of the study 
subjects. The SSCV gel in panel a contains the products amplified from two 
samples that are homozygous for the 1060A allele (lanes 1 and 3), a 
heterozygous sample (lane 2) with both the 896A and 896G alleles, and a 
homozygous sample (lane 4) with only the 896G allele. The SSCV gel in panel 

20 b contains the products amplified from a sample that is homozygous for the T 
allele at position -1 1 and the T allele at position 315 (lane 1), and from a sample 
that is heterozygous with a T and a deletion at position -1 1 and a T and a C allele 
at position 315. The nucleotide numbers are based on the previously published 
TLR4 cDNA sequence (Rock et al., 1998). 

25 Figure 4. The aspartic acid residue at position 299 is conserved. A 

portion of the predicted amino acid sequence is aligned for the TLR4 genes from 
human (Rock et al, 1998; SEQ ID N0:4), mouse (Poitorak et al, 1998; SEQ ID 
N0:5), rat (Genbank Accession No. AF057025; SEQ ID N0:6), and hamster (D. 
Golenbock, SEQ ID N0:7). The position of the first amino acid in each 

30 sequence is given. The aspartic acid at position 299 is indicated with an arrow. 
Conserved amino acids are shaded. 

Figure 5. Frequency distribution histogram of the dose-response slope 
(% decline in FEV,/^g LPS). The % decline in FEV/^ig LPS was calculated 

6 



wo 00/77204 



PCTAJSOO/15723 



following administration of the cumulative LPS dose that either resulted in at 
least a 20 % decline in FEV, or the decline in FEVj following a cumulative 
inhaled dose of 41.5 ^g LPS. Subjects above the x-axis (solid bars) are 
homozygous for the 896A allele (WT/WT); subjects below the x-axis (open bars) 
5 are either heterozygous or homozygous (*) for the missense Asp299Gly allele. 
The data is replotted in the inset after the values on the x-axis were log 
normalized. P values are presented for the comparison of the % decline in 
FEV,/|xg LPS between subjects with the WT/WT genotype (N=73) and those 
with at least one Asp299Gly allele (N=10) using absolute values (P=0.037) and 

10 log normalized values (P=0.026). Since the distribution of the dose-response 
slope (% decline FEV, /cumulative dose of inhaled LPS) was highly skewed, the 
two-sample Monte-Carlo permutation test based on 10,000 permutations was 
used to calculate P values (Fisher et al., 1993) the missense mutation (N=10) 
using absolute values (p=0.04) and the log normalized values (p=0.04) and the 

1 5 log normalized values (p=0.025). 

Figure 6. A) CHO/CD-14 cells that express the CD- 14 receptor (generous 
gift of Doug Golenbock) were transfected with a wild-type or mutant TLR4 
allele, and then exposed to LPS. Cells were co-transfected with a luciferase 
reporter construct. B) IL-la levels from airway epithelial cells of individuals 

20 which had been genotyped for TLR4. II- la levels before and after LPS exposure 
are shown. C) IL-la levels from airway epithelial cells from a TLR4 
heterozygote (WT/Asp299Gly). The cells were transduced with a recombinant 
adenovirus which expresses TLR4 (Rock et al., 1998), or a recombinant 
adenovirus which expresses green fluorescent protein (GFP), and then exposed 

25 to LPS. D) TNF-a levels from alveolar macrophage from a TLR4 homozygote 
(Asp299Gly/Asp299Gly). The cells were transduced with a recombinant 
adenovirus which expresses TLR4 (Rock et al, 1998), and then exposed to LPS. 
Figure 7. Codons. 

Figure 8. Exemplary amino acid substitutions. 
30 Figure 9. A) A partial nucleotide sequence of genomic human TLR4 

DNA (SEQ ID NO:62). B) The 5' UTR, exon 1 (nt 1 100-1283), and a partial 
sequence of intron 1 (Genbank Accession No. AF 172 169; SEQ ID NO:70). C) 
Exon 2 (nt 192-31 1), exon 3 (nt 556-722), and intron 2 (244 bp) (Genbank 

7 
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Accession No. AF172170;SEQ ID N0:71). D) Exon 4 (nt 1691-6172) and the 
3* UTR (Genbank Accession No. AF 1721 71; SEQ ID NO:72). 

Detailed Description of the Invention 

5 As used herein, the terms "isolated and/or purified" refer to in vitro 

preparation, isolation and/or purification of a nucleic acid molecule or 
polypeptide, so that it is not associated with in vivo substances. Thus, with 
respect to an "isolated nucleic acid molecule", which includes a polynucleotide 
of genomic, cDNA, or synthetic origin or some combination thereof, the 

10 "isolated nucleic acid molecule" (1) is not associated with all or a portion of a 
polynucleotide in which the "isolated nucleic acid molecule" is found in nature, 

(2) is operably linked to a polynucleotide which it is not linked to in nature, or 

(3) does not occur in nature as part of a larger sequence. An isolated nucleic acid 
molecule means a polymeric form of nucleotides of at least 10 bases in length, 

1 5 either ribonucleotides or deoxynucleotides or a modified form of either type of 
nucleotide. The term includes single and double stranded forms of DNA. The 
term "oligonucleotide" referred to herein includes naturally occurring, and 
modified nucleotides linked together by naturally occurring, and non-naturally 
occurring oligonucleotide linkages. Oligonucleotides are a polynucleotide 

20 subset with 200 bases or fewer in length. Preferably, oligonucleotides are 10 to 
60 bases in length and most preferably 12, 13, 14, 15, 16, 17, 18, 19, or 20 to 40 
bases in length. Oligonucleotides are usually single stranded, e.g., for primers or 
probes; although oligonucleotides may be double stranded, e.g., for use in the 
construction of a variant. Oligonucleotides of the invention can be either sense 

25 or antisense oligonucleotides. The term "naturally occurring nucleotides" 

referred to herein includes deoxyribonucleotides and ribonucleotides. The term 
"modified nucleotides" referred to herein includes nucleotides with modified or 
substituted sugar groups and the like. The term "oligonucleotide linkages" 
referred to herein includes oligonucleotides linkages such as phosphorothioate, 

30 phosphorodithioate, phophoroselenoate, phosphorodiselenoate, 

phosphoroanilothioate, phosphoraniladate, phosphoroamidate, and the like. An 
oligonucleotide can include a label for detection, if desired. 
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The term "isolated polypeptide" means a polypeptide encoded by 
genomic DNA, cDNA or recombinant RNA, or is synthetic origin, or some 
combination thereof, which isolated polypeptide (1) is not associated with 
proteins found in nature, (2) is free of other proteins from the same source, e.g., 
5 free of human proteins, (3) is expressed by a cell from a different species, or (4) 
does not occur in nature. 

The term "sequence homology" means the proportion of base matches 
between two nucleic acid sequences or the proportion amino acid matches 
between two amino acid sequences. When sequence homology is expressed as a 

10 percentage, e.g., 50 %, the percentage denotes the proportion of matches over the 
length of sequence from one TLR4 allele that is compared to another TLR4 
allele. Gaps (in either of the two sequences) are permitted to maximize 
matching; gap lengths of 15 bases or less are usually used, 6 bases or less are 
preferred with 2 bases or less more preferred. When using oligonucleotides as 

15 probes or treatments, the sequence homology between the target nucleic acid and 
the oligonucleotide sequence is generally not less than 17 target base matches 
out of 20 possible oligonucleotide base pair matches (85 %); preferably not less 
than 9 matches out of 10 possible base pair matches (90 %), and more preferably 
not less than 19 matches out of 20 possible base pair matches (95 %). 

20 The term "selectively hybridize" means to detectably and specifically 

bind. Polynucleotides, oligonucleotides and fragments of the invention 
selectively hybridize to nucleic acid strands under hybridization and wash 
conditions that minimize appreciable amounts, of detectable binding to 
nonspecific nucleic acids. High stringency conditions can be used to achieve 

25 selective hybridization conditions as known in the art and discussed herein. 
Generally, the nucleic acid sequence homology between the polynucleotides, 
oligonucleotides, and fragments of the invention and a nucleic acid sequence of 
interest is at least 65 %, and more typically with preferably increasing 
homologies of at least about 70 %, about 90 %, about 95 %, about 98 %, and 100 

30 %. 

Two amino acid sequences are homologous if there is a partial or 
complete identity between their sequences. For example, 85 % homology means 
that 85 % of the amino acids are identical when the two sequences are aligned 

9 
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for maximum matching. Gaps (in either of the two sequences being matched) 
are allowed in maximizing matching; gap lengths of 5 or less are preferred with 
2 or less being more preferred. Alternatively and preferably, two protein 
sequences (or polypeptide sequences derived from them of at least 30 amino 
5 acids in length) are homologous, as this term is used herein, if they have an 

alignment score of at more than 5 (in standard deviation units) using the program 
ALIGN with the mutation data matrix and a gap penalty of 6 or greater. See 
Dayhoff, M. O., in Atlas of Protein Sequence and Structure, 1972, volume 5, 
National Biomedical Research Foundation, pp. 101-110, and Supplement 2 to 

10 this volume, pp. 1-10. The two sequences or parts thereof are more preferably 
homologous if their amino acids are greater than or equal to 50 % identical when 
optimally aligned using the ALIGN program. 

The term "corresponds to" is used herein to mean that a polynucleotide 
sequence is homologous (i.e., is identical, not strictly evolutionarily related) to 

15 all or a portion of a reference polynucleotide sequence, or that a polypeptide 
sequence is identical to a reference polypeptide sequence. In contradistinction, 
the term "complementary to" is used herein to mean that the complementary . 
sequence is homologous to all or a portion of a reference polynucleotide 
sequence. For illustration, the nucleotide sequence "TATAC" corresponds to a 

20 reference sequence "TAT AC" and is complementary to a reference sequence 
"GTATA". 

The following terms are used to describe the sequence relationships 
between two or more polynucleotides: "reference sequence", "comparison 
window", "sequence identity", "percentage of sequence identity", and 

25 "substantial identity". A "reference sequence" is a defined sequence used as a 
basis for a sequence comparison; a reference sequence may be a subset of a 
larger sequence, for example, as a segment of a full-length cDNA or gene 
sequence given in a sequence listing, or may comprise a complete cDNA or gene 
sequence. Generally, a reference sequence is at least 20 nucleotides in length, 

30 frequently at least 25 nucleotides in length, and often at least 50 nucleotides in 
length. Since two polynucleotides may each (1) comprise a sequence (i.e., a 
portion of the complete polynucleotide sequence) that is similar between the two 
polynucleotides, and (2) may further comprise a sequence that is divergent 

10 
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between the two polynucleotides, sequence comparisons between two (or more) 
polynucleotides are typically performed by comparing sequences of the two 
polynucleotides over a "comparison window" to identify and compare local 
regions of sequence similarity. 
5 A "comparison window", as used herein, refers to a conceptual segment 

of at least 20 contiguous nucleotides and wherein the portion of the 
polynucleotide sequence in the comparison window may comprise additions or 
deletions (i.e., gaps) of 20 percent or less as compared to the reference sequence 
(which does not comprise additions or deletions) for optimal alignment of the 

10 two sequences. Optimal aligiunent of sequences for aligning a comparison 
window inay be conducted by the local homology algorithm of Smith and 
Waterman (1981) Adv Appl. Math. 2: 482, by the homology alignment 
algorithm of Needleman and Wunsch (1970) T. Mol. Biol. 4&: 443, by the search 
for similarity method of Pearson and Lipman (1988) Proc. Natl . Acad. Sci. 

15 (U.S.A. ) S5: 2444, by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package 
Release 7.0, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by 
inspection, and the best alignment (i.e., resulting in the highest percentage of 
homology over the comparison window) generated by the various methods is 

20 selected. 

The term "sequence identity" means that two polynucleotide sequences 
are identical (i.e., on a nucleotide-by-nucleotide basis) over the window of 
comparison. The term "percentage of sequence identity" means that two 
polynucleotide sequences are identical (i.e., on a nucleotide-by-nucleotide basis) 

25 over the window of comparison. The term "percentage of sequence identity" is 
calculated by comparing two optimally aligned sequences over the window of 
comparison, determining the number of positions at which the identical nucleic 
acid base (e.g.. A, T, C, G, U, or I) occurs in both sequences to yield the number 
of matched positions, dividing the number of matched positions by the total 

30 number of positions in the window of comparison (i.e., the window size), and 
multiplying the result by 100 to yield the percentage of sequence identity. The 
terms "substantial identity" as used herein denote a characteristic of a 
polynucleotide sequence, wherein the polynucleotide comprises a sequence that 
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has at least 85 percent sequence identity, preferably at least 90 to 95 percent 
sequence identity, more usually at least 99 percent sequence identity as 
compared to a reference sequence over a comparison window of at least 20 
nucleotide positions, frequently over a window of at least 20-50 nucleotides, 
5 wherein the percentage of sequence identity is calculated by comparing the 
reference sequence to the polynucleotide sequence which may include deletions 
or additions which total 20 percent or less of the reference sequence over the 
window of comparison. 

As applied to polypeptides, the term "substantial identity'' means that two 

10 peptide sequences, when optimally aligned, such as by the programs GAP or 
BESTFIT using default gap weights, share at least about 80 percent sequence 
identity, preferably at least about 90 percent sequence identity, more preferably 
at least about 95 percent sequence identity, and most preferably at least about 99 
percent sequence identity. 

15 As used herein, "substantially pure" means an object species is the 

predominant species present (i.e., on a molar basis it is more abundant than any 
other individual species in the composition), and preferably a substantially 
purified fraction is a composition wherein the object species comprises at least 
about 50 percent (on a molar basis) of all macromolecular species present. 

20 Generally, a substantially pure composition will comprise more than about 80 
percent of all macromolecular species present in the composition, more 
preferably more than about 85 %, about 90 %, about 95 %, and about 99 %. 
Most preferably, the object species is purified to essential homogeneity 
(contaminant species cannot be detected in the composition by conventional 

25 detection methods) wherein the composition consists essentially of a single 
macromolecular species. 

An isolated "variant" TLR4 polypeptide has at least 50 %, preferably at 
least about 80 %, and more preferably at least about 90 %, but less than 100 %, 
contiguous amino acid sequence homology or identity to the amino acid 

30 sequence of a reference (wild-type) TLR4 polypeptide. Preferably, the TLR4 
polypeptides of the invention are biologically active. Biologically active 
polypeptides include those that induce an immune response when administered 
to an organism, are bound by antibodies specific for TLR4, activate signaling 
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through NF-KB and AP-1, interact with CD 14, or induce cytokine release 
following LPS stimulation. While it is preferred that a variant TLR4 has at least 
about 0.1%, preferably at least about 1%, and more preferably at least about 
10%, of the activity of wild-type TLR4, the invention includes variant TLR4 
5 polypeptides having no detectable biological activity. Likewise, a "variant" 
TLR4 nucleic acid molecule has at least about 80%, preferably at least about 
90% and more preferably at least about 95%, but less than 100% contiguous 
nucleic acid sequence homology or identity to the nucleic acid sequence of a 
wild-type TLR4 gene. 

10 As used herein, an "indication or condition associated with aberrant, 

modified or altered innate immunity" includes, but is not limited to, 
hyporesponsiveness to LPS, susceptibility to infection with gram-negative 
bacteria, susceptibility to sepsis by gram-negative bacteria, susceptibility to 
chronic airway disease, susceptibility to asthma, susceptibility to arthritis, 

15 susceptibility to pyelonephritis, susceptibility to gall bladder disease, 

susceptibility to pneumonia, susceptibility to bronchitis, susceptibility to chronic 
obstructive pulmonary disease, severity of cystic fibrosis, and susceptibility to 
local and systemic inflammatory conditions, e.g., systemic inflammatory 
response syndrome (SIRS), local gram negative bacterial infection, or acute 

20 respiratory distress syndrome (ARDS). 

A. Nucleic Acid Molecules of the Invention 

1 . Sources of the Nucleic Acid Molecules of the Invention 

Sources of nucleotide sequences from which the present nucleic acid 

25 molecules encoding TLR4, a portion (firagment) thereof, a variant thereof or the 
nucleic acid complement thereof, include total or poly A* RNA fi-om any 
mammalian, preferably human, cellular source from which cDNAs can be 
derived by methods known in the art. Other sources of the DNA molecules of 
the invention include genomic libraries derived fi-om any mammalian cellular 

30 source. Moreover, the present DNA molecules may be prepared in vitro, e.g., by 
synthesizing an oligonucleotide of about 100, preferably about 75, more 
preferably about 50, and even more preferably about 40, nucleotides in length, or 
by subcloning a portion of a DNA segment that encodes a particular TLR4. 
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2, fsolation of a Gene Encoding TI.R4 

A nucleic acid molecule encoding TLR4 can be identified and isolated 
using standard methods, as described by Sambrook et al.. Molecular Cloning : A 
Laboratory Manual , Cold Spring Harbor, NY (1989). For example, reverse- 
5 transcriptase PCR (RT-PCR) can be employed to isolate and clone TLR4 
cDNAs. Oligo-dT can be employed as a primer in a reverse transcriptase 
reaction to prepare first-strand cDNAs from isolated RNA which contains RNA 
sequences of interest, e.g., total RNA isolated from human tissue. RNA can be 
isolated by methods known to the art, e.g., using TRIZOL"** reagent (GIBCO- 

10 BRL/Life Technologies, Gaithersburg, MD). Resultant first-strand cDNAs are 
then amplified in PCR reactions. 

"Polymerase chain reaction" or "PCR" refers to a procedure or technique 
in which amounts of a preselected fragment of nucleic acid, RNA and/or DNA, 
are amplified as described in U.S. Patent No. 4,683,195. Generally, sequence 

1 5 information from the ends of the region of interest or beyond is employed to 
design oligonucleotide primers comprising at least 7-8 nucleotides. These 
primers will be identical or similar in sequence to opposite strands of the 
template to be amplified. PCR can be used to amplify specific RNA sequences, 
specific DNA sequences from total genomic DNA, and cDNA transcribed from 

20 total cellular RNA, bacteriophage or plasmid sequences, and the like. See 

generally MuUis et al.. Cold Spring Harbor Symp. Quant, Biol, , 11, 263 (1987); 
Eriich, ed., PCR Technology , (Stockton Press, NY, 1989). Thus, PCR-based 
cloning approaches rely upon conserved sequences deduced from alignments of 
related gene or polypeptide sequences. 

25 Primers are made to correspond to highly conserved regions of 

polypeptides or nucleotide sequences which were identified and compared to 
generate the primers, e.g., by a sequence comparison of other mammalian TLR4. 
One primer is prepared which is predicted to anneal to the antisense strand, and 
another primer prepared which is predicted to anneal to the sense strand, of a 

30 DNA molecule which encodes TLR4. 

The products of each PCR reaction are separated via an agarose gel and 
all consistently amplified products are gel-purified and cloned directly into a 
suitable vector, such as a known plasmid vector. The resultant plasmids are 
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subjected to restriction endonuclease and dideoxy sequencing of double-stranded 
plasmid DNAs. 

Another approach to identify, isolate and clone cDNAs which encode 
TLR4 is to screen a cDNA or genomic library. Screening for DNA fragments 
5 that encode all or a portion of a DNA encoding TLR4 can be accomplished by 
probing the library with a probe which has sequences that are highly conserved 
between genes believed to be related to TLR4, e.g., the homolog of a particular 
TLR4 from a different species, or by screening of plaques for binding to 
antibodies that specifically recognize TLR4. DNA fragments that bind to a 

1 0 probe having sequences which are related to TLR4, or which are 

immunoreactive with antibodies to TLR4, can be subcloned into a suitable vector 
and sequenced and/or used as probes to identify other cDNAs encoding ail or a 
portion of TLR4. 

Thus, "isolated and/or purified TLR4*' nucleic acid refers to in vitro 

15 isolation of a nucleic acid molecule from its natural cellular environment, and 
from association with other components of the cell, such as nucleic acid or 
polypeptide, so that it can be sequenced, replicated, and/or expressed. For 
example, "isolated TLR4 nucleic acid" is RNA or DNA containing greater than 
9, preferably 36, and more preferably 45 or more, sequential nucleotide bases 

20 that encode at least a portion of TLR4, a variant thereof, RNA or DNA 

complementary thereto, or which hybridizes to, RNA or DNA comprising TLR4 
sequences, and remains stably bound under stringent conditions, as defined by 
methods well known in the art, e.g., in Sambrook et al., supra. Thus, the RNA 
or DNA is "isolated" in that it is free from at least one contaminating nucleic 

25 acid with which it is normally associated in the natural source of the RNA or 
DNA and is preferably substantially free of any other mammalian RNA or DNA. 
The phrase "free from at least one contaminating source nucleic acid with which 
it is normally associated" includes the case where the nucleic acid is reintroduced 
into the source or natural cell but is in a different chromosomal location or is 

30 otherwise flanked by nucleic acid sequences not normally found in the source 
cell. 

As used herein, the term "recombinant nucleic acid", e.g., "recombinant 
DNA sequence or segment", refers to a nucleic acid, e.g., to DNA, that has been 
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derived or isolated from any appropriate tissue source, that may be subsequently 
chemically altered /// vitro, so that its sequence is not naturally occurring, or 
corresponds to naturally occurring sequences that are not positioned as they 
would be positioned in a genome which has not been transformed with 
5 exogenous DNA. An example of DNA "derived" from a source, would be a 
DNA sequence that is identified as a useful fragment within a given organism, 
and which is then chemically synthesized in essentially pure form. An example 
of such DNA "isolated" from a source would be a useful DNA sequence that is . 
excised or removed from said source by chemical means, e.g., by the use of 

10 restriction endonucleases, so that it can be further manipulated, e.g., amplified, 
for use in the invention, by the methodology of genetic engineering. 

Thus, recovery or isolation of a given fi-agment of DNA from a restriction 
digest can employ separation of the digest on polyacrylamide or agarose gel by 
electrophoresis, identification of the fragment of interest by comparison of its 

15 mobility versus that of marker DNA fragments of known molecular weight, 

removal of the gel section containing the desired fragment, and separation of the 
gel from DNA. See Lawn et al.. Nucleic Acids Res. , 2, 6103 (1981), and 
Goeddel et al.. Nucleic Adds Res, , &, 4057 (1980). Therefore, "isolated DNA" 
includes completely synthetic DNA sequences, semi-synthetic DNA sequences, 

20 DNA sequences isolated from biological sources, and DNA sequences derived 
from RNA, as well as mixtures thereof 

As used herein, the term "derived" with respect to a RNA molecule 
means that the RNA molecule has complementary sequence identity to a 
particular DNA molecule. 

25 Variants of the Nucleic Acid M olecules of the Invention 

Nucleic acid molecules encoding amino acid sequence variants of TLR4 
are prepared by a variety of methods known in the art. These methods include, 
but are not limited to, isolation from a natural source (in the case of naturally 
occurring amino acid sequence variants) or preparation by oligonucleotide- 

30 mediated (or site-directed) mutagenesis, PGR mutagenesis, and cassette 
mutagenesis of an earlier prepared variant or a non- variant version of TLR4 
nucleic acid. 
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Oligonucleotide-mediated mutagenesis is a preferred method for 
preparing amino acid substitution variants of TLR4. This technique is well 
known in the art as described by Adelman et al., DNA, 2, 183 (1983). Briefly, 
TLR4 DNA is altered by hybridizing an oligonucleotide encoding the desired 
5 mutation to a DNA template, where the template is the single-stranded form of a 
plasmid or bacteriophage containing the unaltered or native DNA sequence of 
TLR4. After hybridization, a DNA polymerase is used to synthesize an entire 
second complementary strand of the template that will thus incorporate the 
oligonucleotide primer, and will code for the selected alteration in the TLR4 
10 DNA. 

Generally, oligonucleotides of at least 25 nucleotides in length are used. 
An optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 
mutation. This ensures that the oligonucleotide will hybridize properly to the 

15 single-stranded DNA template molecule. The oligonucleotides are readily 

synthesized using techniques known in the art such as that described by Crea et 
al, Proc. Natl, Acad, Sci. U,S.A,> 25, 5765 (1978). 

The DNA template can be generated by those vectors that are either 
derived from bacteriophage Ml 3 vectors (the commercially available M13mpl8 

20 and M13mpl9 vectors are suitable), or those vectors that contain a single- 
stranded phage origin of replication as described by Viera et al., Meth. Rn73miol ., 
151, 3 (1987). Thus, the DNA that is to be mutated may be inserted into one of 
these vectors to generate single-stranded template. Production of the single- 
stranded template is described in Sections 4.21-4.41 of Sambrook et al., 

25 Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory 
Press, N.Y. 1989). 

Alternatively, single-stranded DNA template may be generated by 
denaturing double-stranded plasmid (or other) DNA using standard techniques. 
For alteration of the native DNA sequence (to generate amino acid 

30 sequence variants, for example), the oligonucleotide is hybridized to the single- 
stranded template under suitable hybridization conditions. A DNA polymerizing 
enzyme, usually the Klenow fragment of DNA polymerase I, is then added to 
synthesize the complementary strand of the template using the oligonucleotide as 
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a primer for synthesis. A heteroduplex molecule is thus formed such that one 
strand of DNA encodes the mutated form of TLR4, and the other strand (the 
original template) encodes the native, unaltered sequence of TLR4. This 
heteroduplex molecule is then transformed into a suitable host cell, usually a 
5 prokaryote such as £. coli JMl 01 . After the cells are grown, they are plated onto 
agarose plates and screened using the oligonucleotide primer radiolabeled with 
32-phosphate to identify the bacterial colonies that contain the mutated DNA, 
The mutated region is then removed and placed in an appropriate vector for 
peptide or polypeptide production, generally an expression vector of the type 
10 typically employed for transformation of an appropriate host. 

The method described immediately above may be modified such that a 
homoduplex molecule is created wherein both strands of the plasmid contain the 
mutations(s). The modifications are as follows: The single-stranded 
oligonucleotide is annealed to the single-stranded template as described above. 
1 5 A mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), 

deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with 
a modified thiodeoxyribocytosine called dCTP-(aS) (which can be obtained 
from the Amersham Corporation). This mixture is added to the template- 
oligonucleotide complex. Upon addition of DNA polymerase to this mixture, a 
20 strand of DNA identical to the template except for the mutated bases is 

generated. In addition, this new strand of DNA will contain dCTP-(aS) instead 
of dCTP, which serves to protect it from restriction endonuclease digestion. 

After the template strand of the double-stranded heteroduplex is nicked 
with an appropriate restriction enzyme, the template strand can be digested with 
25 ExoIII nuclease or another appropriate nuclease past the region that contains the 
site(s) to be mutagenized. The reaction is then stopped to leave a molecule that 
is only partially single-stranded. A complete double-stranded DNA homoduplex 
is then formed using DNA polymerase in the presence of all four 
deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex 
30 molecule can then be transformed into a suitable host cell such as E. coli JMlOl . 

For example, a preferred embodiment of the invention is an isolated and 
purified DNA molecule comprising a human DNA segment encoding a variant 
TLR4 having a nucleotide substitution at position 896 (A896G) which encodes 
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an amino acid substitution (Asp299Gly). Other nucleotide substitutions which 
result in silent mutations, missense mutations, or a nonsense mutations, can be 
ascertained by reference to Figure 7, Figure 8 and page Dl in Appendix D in 
<;anihrnnk et aK, Molecular Cloning! A T.ahnratnry Manual (1989). 

5 R rhimeric Expression Cassettes 

To prepare expression cassettes for transformation herein, the 
recombinant DNA sequence or segment may be circular or linear, double- 
stranded or single-stranded, A DNA sequence which encodes an RNA sequence 
that is substantially complementary to a mRNA sequence encoding TLR4 is 

1 0 typically a "sense" DNA sequence cloned into a cassette in the opposite 

orientation (i.e., 3' to 5' rather than 5' to 3'). Generally, the DNA sequence or 
segment is in the form of chimeric DNA, such as plasmid DNA, that can also 
contain coding regions flanked by control sequences which promote the 
expression of the DNA present in the resultant cell line. 

1 5 As used herein, "chimeric*' means that a vector comprises DNA from at 

least two different species, or comprises DNA from the same species, which is 
linked or associated in a manner which does not occur in the "native" or wild 
type of the species. 

Aside from DNA sequences that serve as transcription units for TLR4, or 

20 portions thereof, a portion of the recombinant DNA may be untranscribed, 

serving a regulatory or a structural function. For example, the DNA may itself 
comprise a promoter that is active in mammalian cells, or may utilize a promoter 
already present in the genome that is the transformation target. Such promoters 
include the CMV promoter, as well as the SV40 late promoter and retroviral 

25 LTRs (long terminal repeat elements), although many other promoter elements 
well known to the art may be employed in the practice of the invention. 

Other elements functional in the host cells, such as introns, enhancers, 
polyadenylation sequences and the like, may also be a part of the DInJA. Such 
elements may or may not be necessary for the function of the DNA, but may 

30 provide improved expression of the DNA by affecting transcription, stability of 
the mRNA, or the like. Such elements may be included in the DNA as desired to 
obtain the optimal performance of the transforming DNA in the cell. 
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"Control sequences" is defined to mean DNA sequences necessary for the 
expression of an operably linked coding sequence in a particular host organism. 
The control sequences that are suitable for prokaryotic cells, for example, 
include a promoter, and optionally an operator sequence, and a ribosome binding 
5 site. Eukaryotic cells are known to utilize promoters, polyadenylation signals, 
and enhancers. 

"Operably linked" is defined to mean that the nucleic acids are placed in 
a functional relationship with another nucleic acid sequence. For example, DNA 
for a presequence or secretory leader is operably linked to DNA for a peptide or 

10 polypeptide if it is expressed as a preprotein that participates in the secretion of 
the peptide or polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being 

1 5 linked are contiguous and, in the case of a secretory leader, contiguous and in 
reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not 
exist, the synthetic oligonucleotide adaptors or linkers are used in accord with 
conventional practice. 

20 The DNA to be introduced into the cells further will generally contain 

either a selectable marker gene or a reporter gene or both to facilitate 
identification and selection of transformed cells from the population of cells 
sought to be transformed. Alternatively, the selectable marker may be carried on 
a separate piece of DNA and used in a co-transformation procedure. Both 

25 selectable markers and reporter genes may be flanked with appropriate 

regulatory sequences to enable expression in the host cells. Useful selectable 
markers are well known in the art and include, for example, antibiotic and 
herbicide-resistance genes, such as neo, hpt, dhfr, bar, aroA, dapA apd the like. 
See also, the genes listed on Table 1 of Lundquist et al. (U.S. Patent No. 

30 5,848,956). 

Reporter genes are used for identifying potentially transformed cells and 
for evaluating the functionality of regulatory sequences. Reporter genes which 
encode for easily assayable proteins are well known in the art. In general, a 

20 
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reporter gene is a gene which is not present in or expressed by the recipient 
organism or tissue and which encodes a protein whose expression is manifested 
by some easily delectable property, e.g., enzymatic activity. Preferred genes 
include the chloramphenicol acetyl transferase gene (cat) from Tn9 of £. coli, the 

5 beta-glucuronidase gene {gus) of the uidA locus of E, colU and the luciferase luc 
gene from firefly Photimis pyralis. Expression of the reporter gene is assayed at 
a suitable time after the DNA has been introduced into the recipient cells. 

The general methods for constructing recombinant DNA which can 
transform target cells are well known to those skilled in the art, and the same 

10 compositions and methods of construction may be utilized to produce the DNA 
useful herein. For example, J. Sambrook et al.. Molecular Cloning: A 
I.ahoratory Manual , Cold Spring Harbor Laboratory Press (2d ed., 1989), 
provides suitable methods of construction. 
C. Transformatio n into Host Cells 

15 The recombinant DNA can be readily introduced into the host cells, e.g., 

mammalian, bacterial, yeast or insect cells by transfection with an expression 
vector comprising DNA encoding TLR4 or its complement, by any procedure 
useful for the introduction into a particular cell, e.g., physical or biological 
methods, to yield a transformed cell having the recombinant DNA stably 

20 integrated into its genome, so that the DNA molecules, sequences, or segments, 
of the present invention are expressed by the host cell. 

Physical methods to introduce a DNA into a host cell include calcium 
phosphate precipitation, lipofection, particle bombardment, microinjection, 
electroporation, and the like. Biological methods to introduce the DNA of 

25 interest into a host cell include the use of DNA and RNA viral vectors. The 
main advantage of physical methods is that they are not associated with 
pathological or oncogenic processes of viruses. However, they are less precise, 
often resulting in multiple copy insertions, random integration, disruption of 
foreign and endogenous gene sequences, and unpredictable expression. Viral 

30 vectors, useful to introduce genes to mammalian cells include, but are not limited 
to, poxvirus vectors, herpes simplex virus I vectors, adenovirus vectors, adeno- 
associated virus vectors, and the like. 
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As used herein, the term "cell line" or "host cell" is intended to refer to 
well-characterized homogenous, biologically pure populations of cells. These 
cells may be eukaryotic cells that are neoplastic or which have been 
"immortalized" in vitro by methods known in the art, as well as primary cells, or 
5 prokaryotic cells. The cell line or host cell is preferably of mammalian origin, 
but cell lines or host cells of non-mammalian origin may be employed, including 
plant, insect, yeast, fungal or bacterial sources. Generally, the DNA sequence is 
related to a DNA sequence which is resident in the genome of the host cell but is 
not expressed, or not highly expressed, or, alternatively, overexpressed. 

10 "Transfected" or "transformed" is used herein to include any host cell or 

cell line, the genome of which has been altered or augmented by the presence of 
at least one DNA sequence, which DNA is also referred to in the art of genetic 
engineering as "heterologous DNA," "recombinant DNA," "exogenous DNA," 
"genetically engineered," "non-native," or "foreign DNA," wherein said DNA 

1 S was isolated and introduced into the genome of the host cell or cell line by the 
process of genetic engineering. The host cells of the present invention are 
typically produced by transfection with a DNA sequence in a plasmid expression 
vector, a viral expression vector, or as an isolated linear DNA sequence. 

To confirm the presence of the DNA sequence in the host cell, a variety 

20 of assays may be performed. Such assays include, for example, "molecular 
biological" assays well known to those of skill in the art, such as Southern and 
Northem blotting, RT-PCR and PGR; "biochemical" assays, such as detecting 
the presence or absence of a particular TLR4, e.g., by immunological means 
(ELIS As and Western blots) or by assays described herein. 

25 To detect and quantitate RNA produced from introduced DNA segments, 

RT-PCR may be employed. In this application of PCR, it is first necessary to 
reverse transcribe RNA into DNA, using enzymes such as reverse transcriptase, 
and then through the use of conventional PCR techniques amplify the DNA, In 
most instances PCR techniques, while useful, will not demonstrate integrity of 

30 the RNA product. Further information about the nature of the RNA product may 
be obtained by Northem blotting. This technique demonstrates the presence of 
an RNA species and gives information about the integrity of that RNA. The 
presence or absence of an RNA species can also be determined using dot or slot 
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blot Northern hybridizations. These techniques are modifications of Northern 
blotting and only demonstrate the presence or absence of an RNA species. 

While Southern blotting and PGR may be used to detect the DNA 
segment in question, they do not provide information as to whether the DNA 
5 segment is being expressed. Expression may be evaluated by specifically 
identifying the polypeptide products of the introduced DNA sequences or 
evaluating the phenotypic changes brought about by the expression of the 
introduced DNA segment in the host cell. 
D. TT.R4 polypeptides, variants, and derivatives thereof 

1 0 The present isolated, purified TLR4 polypeptides, variants or derivatives 

thereof, can be synthesized in vitro, e.g., by the solid phase peptide synthetic 
method or by recombinant DNA approaches (see above). The solid phase 
peptide synthetic method is an established and widely used method, which is 
described in the following references: Stewart et ah. Solid Phase Peptide 

15 Synthesis , W. H. Freeman Co., San Francisco (1969); Merrifield, J, Am. Chem. 
Soc, &5 2149 (1963); Meienhofer in "Hormonal Proteins and Peptides," ed.; 
C.H. Li, Vol. 2 (Academic Press, 1973), pp. 48-267; Bavaay and Merrifield, 
"The Peptides," eds. E. Gross and F. Meienhofer, Vol. 2 (Academic Press, 1980) 
pp. 3-285; and Clark-Lewis et al., Meth. FnzymoL , 2S2, 233 (1997). These 

20 peptides can be further purified by fractionation on immunoaffinity or ion- 
exchange columns; ethanol precipitation; reverse phase HPLC; chromatography 
on silica or on an anion-exchange resin such as DEAE; chromatofocusing; SDS- 
PAGE; ammonium sulfate precipitation; gel filtration using, for example, 
Sephadex G-75; or ligand affinity chromatography. 

25 Once isolated and characterized, derivatives, e.g., chemically derived 

derivatives, of a given TLR4 can be readily prepared. For example, amides of 
the TLR4 or TLR4 variants of the present invention may also be prepared by 
techniques well known in the art for converting a carboxylic acid group or 
precursor, to an amide. A preferred method for amide formation at the C- 

30 terminal carboxyl group is to cleave the peptide from a solid support with an 
appropriate amine, or to cleave in the presence of an alcohol, yielding an ester, 
followed by aminolysis with the desired amine. 
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Salts of carboxyl groups of a peptide or peptide variant of the invention 
may be prepared in the usual manner by contacting the peptide with one or more 
equivalents of a desired base such as, for example, a metallic hydroxide base, 
e.g., sodium hydroxide; a metal carbonate or bicarbonate base such as, for 

5 example, sodium carbonate or sodium bicarbonate; or an amine base such as, for 
example, triethylamine, triethanolamine, and the like. 

N-acyl derivatives of an amino group of the TLR4 or variant thereof may 
be prepared by utilizing an N-acyl protected amino acid for the final 
condensation, or by acylating a protected or unprotected peptide. 0-acyl 

10 derivatives may be prepared, for example, by acylation of a free hydroxy peptide 
or peptide resin. Either acylation may be carried out using standard acylating 
reagents such as acyl halides, anhydrides, acyl imidazoles, and the like. Both N- 
and 0-acylation may be carried out together, if desired. 

Formyl-methionine, pyroglutamine and trimethyl-alanine may be 

15 substituted at the N-terminal residue of the peptide or peptide variant. Other 
amino-terminal modifications include aminooxypentane modifications (see 
Simmons et al.. Science, 226, 276 (1997)). 

In addition, the amino acid sequence of TLR4 can be modified so as to 
result in a variant TLR4. The modification includes the substitution of at least 

20 one amino acid residue in the polypeptide for another amino acid residue, 

including substitutions which utilize the D rather than L form, as well as other 
well known amino acid analogs, e.g., unnatural amino acids such as a, a- 
disubstituted amino acids, N-alkyl amino acids, lactic acid, and the like. These 
analogs include phosphoserine, phosphothreonine, phosphotyrosine, 

25 hydroxyproline, gamma-carboxyglutamate; hippuric acid, octahydroindole-2- 
carboxylic acid, stating, l,2,3,4,-tetrahydroisoquinoline-3-carboxylic acid, 
penicillamine, ornithine, citrulline, a-methyl-alanine, para-benzoyl- 
phenylalanine, phenylglycine, propargylglycine, sarcosine, e-N,N,N- 
trimethyllysine, e-N-acetyllysine, N-acetylserine, N-formylmethionine, 3- 

30 methylhistidine, 5-hydroxylysine, a)-N-methylarginine, and other similar amino 
acids and imino acids and tert-butylglycine. 

Conservative amino acid substitutions include aspartic-glutamic as acidic 
amino acids; lysine/arginine/histidine as basic amino acids; leucine/isoleucine, 
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methionine/valine, alanine/valine as hydrophobic amino acids; 
serine/glycine/alanine/threonine as hydrophilic amino acids. Conservative 
amino acid substitution also includes groupings based on side chains. For 
example, a group of amino acids having aliphatic side chains is glycine, alanine, 
5 valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl 
side chains is serine and threonine; a group of amino acids having amide- 
containing side chains is asparagine and glutamine; a group of amino acids 
having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group 
of amino acids having basic side chains is lysine, arginine, and histidine; and a 

10 group of amino acids having sulfur-containing side chains is cysteine and 

methionine. For example, it is reasonable to expect that replacement of a leucine 
with an isoleucine or valine, an aspartate with a glutamate, a threonine with a 
serine, or a similar replacement of an amino acid with a structurally related 
amino acid will not have a major effect on the properties of the resulting variant 

15 polypeptide. Whether an amino acid change results in a functional polypeptide 
can readily be determined by assaying the activity of the polypeptide variant. 
Such assays are described herein. 

Conservative substitutions are shown in Figure 8 under the heading of 
exemplary substitutions. More preferred substitutions are under the heading of 

20 preferred substitutions. After the substitutions are introduced, the variants are 
screened for biological activity. 

Amino acid substitutions falling within the scope of the invention, are, in 
general, accomplished by selecting substitutions that do not differ significantly 
in their effect on maintaining (a) the structure of the peptide backbone in the area 

25 of the substitution, (b) the charge or hydrophobicity of the molecule at the target 
site, or (c) the bulk of the side chain. Naturally occurring residues are divided 
into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 
30 (3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic; trp, tyr, phe. 
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The invention also envisions polypeptide variants with non-conservative 
substitutions. Non-conservative substitutions entail exchanging a member of 
one of the classes described above for another. 

Acid addition salts of the polypeptide or variant polypeptide or of amino 
5 residues of the polypeptide or variant polypeptide may be prepared by contacting 
the polypeptide or amine with one or more equivalents of the desired inorganic 
or organic acid, such as, for example, hydrochloric acid. Esters of carboxyl 
groups of the polypeptides may also be prepared by any of the usual methods 
known in the art. 

10 F Methods of the Invention 

The invention provides a method to diagnose individuals who are at a 
greater risk of deleterious consequences due to bacterial infection, e.g., the 
individuals may be more susceptible to infection by gram-negative bacteria, e.g., 
local gram-negative infection, more susceptible to sepsis induced by gram- 

15 negative bacteria, more susceptible to chronic airway disease, more susceptible 
to asthma, more susceptible to gall bladder disease, more susceptible to 
pyelonephritis, more susceptible to pneumonia, more susceptible to bronchitis, 
more susceptible to chronic obstructive pulmonary disease, more susceptible to 
arthritis, at higher risk for severe cystic fibrosis, and more susceptible to local 

20 and systemic inflammatory conditions such as systemic inflammatory response 
syndrome (SIRS), and acute respiratory distress syndrome (ARDS). The 
invention is also useful in the development of drugs that target the TLR4 gene 
product, e.g., increase or decrease the function of TLR4, especially the 
extracellular domain. These agents may thus be useful to prevent or ameliorate 

25 infection by gram-negative bacteria, prevent or ameliorate sepsis induced by 
gram-negative bacteria, prevent or ameliorate LPS-induced chronic airway 
disease in normal, cystic fibrosis and asthmatic populations, prevent or 
ameliorate arthritis, and prevent or ameliorate local and systemic inflammatory 
conditions such as SIRS and ARDS, particularly in individuals at risk for these 

30 indications or conditions. 

The invention will be further described by the following non-limiting 
examples. 
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Kxamp le 1 

Methods 

Study Subjects . The study population consisted of 83 healthy adult volunteers 
(31 men, 52 women) aged 18-50. Exclusion criteria included any history of 
5 allergies, tobacco use, cardiac or pulmonary disease. After written informed 
consent was obtained, all subjects were screened with spirometry, inhalation 
challenge with histamine, skin testing for common aeroallergens, chest x-ray, 
and electrocardiogram. All participants had normal screening studies (including 
histamine PCjo > 32 mg/ml), were on no medications (except birth control), and 
10 had no significant acute or chronic cardiopulmonary disease or occupational 
exposures. 

Tnhalatinn Challenge Protocol . All subjects were exposed by inhalation 
challenge to buffered sterile saline (HBSS) followed by increasing 
concentrations of LPS. The solutions were delivered via a DeVilbiss 646 

1 5 nebulizer powered by compressed air at 30 psi (DeVilbiss Co., Somerset, PA) 
and a Rosenthal dosimeter (Laboratory for Applied Immunology, Baltimore, 
MD). After the HBSS, subsequent inhalations delivered in increasing doses of 
LPS according to the following schedule: 0.5 ^g, 1 .0 ^ig, 2.0 ng, 3.0 |ig, 5.0 fig, 
10 \ig, and 20 ^g. Thus, the entire protocol delivered a total of 4L5 ^g of LPS. 

20 Incremental LPS inhalation challenge . The incremental LPS inhalation challenge 
was performed as follows. The % decline in FEV,/^g LPS was calculated 
following administration of the cumulative LPS dose that either resulted in at 
least a 20% decline in FEV, or the decline in FEV, following a cumulative 
inhaled dose of 41 .5 fig of LPS. Subjects above the x-axis (solid bars in Figure 

25 5) are homozygous for the wild type allele (WT/WT); subjects below the x-axis 
(open bars in Figure 5) are either heterozygous or homozygous (*) for the 
missense Asp299Gly allele. The data is replotted in the inset after the values for 
the dose-response slope were log normalized. P values are presented for the 
comparison of the % decline in FEV, /fig LPS between subjects with the WTAVT 

30 genotype (N=73) and those with at least one Asp299Gly allele (N=10) using 
absolute values (P=0.037) and log normalized values (P=0.026). Since the 
distribution of the dose-response slope (% decline FEV, /cumulative dose of 
inhaled LPS) was highly skewed, the two-sample Monte-Carlo permutation test 
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based on 10,000 permutations was used to calculate P values (Fisher et al., 
1993). To assess the allelic frequency of TLR4 sequence variants, a well- 
characterized Iowa population (Lidral et al., 1998) and the Centre d'Etude du 
Polymorphisme Humain (CEPH) population (NIH-CEPIT, 1992) were screened 
5 for specific sequence variants identified in the 83 study subjects, 

Fndotoxin , Solutions of endotoxin for inhalation were prepared according to a 
standard protocol using lyophilized Escherichia coli (serotype 0111 :B4, Sigma 
Chemical Co., St. Louis, MO) LPS. These solutions of LPS were resuspended in 
sterile Hank's balanced salt solution (without calcium or magnesium) at a pH of 
10 7.0 and filter sterilized. All solutions used for inhalation were tested for sterility 
(bacteria and fungi) and LPS content {Limulus amebocyte lysate assay, QCL- 
1000; Whittaker Bioproducts, Walkersville, MD) prior to separation into 
individual aliquots. These aliquots were stored immediately after preparation at 
-l(fC until used. 

15 Physiolngi c Measurements . A Spirotech (Atlanta, GA) S600 spirometer was 
used to assess pulmonary function; spirometry was performed using standards 
established by the American Thoracic Society. Subjects were positioned upright 
in a chair and were using noseclips. Baseline spirometry was recorded after 
inhalation of saline, and then 1, 10, 20, and 30 minutes following inhalation of 

20 each dose of LPS, and compared with the post-saline baseline spirometry. If the 
study subject's FEVj was greater than 80 % of the baseline measurement at the 
final assessment (30 minutes post-saline), the inhalation challenge was continued 
and the next does of LPS was administered. The challenge test was terminated 
when any of the following criteria had been met: 1) the subject did not wish to 

25 continue for any reason; 2) the subject's FEV, decreased 20 % or greater from 
baseline; or 3) a cumulative dose of 4L5 \ig had been achieved. Of 84 subjects 
enrolled in the study, 1 subject withdrew prior to completion of the LPS 
inhalation challenge test, 52 subjects had at least a 20 % decline in the FEV, 
during the LPS inhalation challenge test, and 31 subjects inhaled a cumulative 

30 dose of 41.5 jig of LPS and did not decrease their FEV, by 20 %. 

Assignme nt of Phenntype . Study subjects were categorized as either 
"responsive" or "hyporesponsive" to inhaled LPS. In the course of previous 
investigations, a large number of study subjects have been exposed to inhaled 
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LPS (Jagielo et al., 1996; Deetz et al., 1997). In general, most healthy non- 
asthmatic study subjects develop airflow obstruction (FEV, < 80 % of the pre- 
exposure value) when challenged with approximately 40 ^g of LPS. Based on 
this and a standard approach to the definition of airway hyperreactivity (FEV, 

5 decline of 20 % from pre-exposure values) (Chai et aL, 1975), subjects were 
categorized as "LPS responsive" if they decreased their FEV, by at least 20 % at 
any point during the LPS inhalation challenge, or "hyporesponsive" if they had < 
20 % decline in their FEV, after irJiahng a cumulative does of 4L5 |ig of LPS. 
Lsolation of Genomic Clone . A human bacterial artificial chromosomal (BAC) 

10 library (Research Genetics; Huntsville, AL) was screened with two sets of 
primers (IF: 5'ATGGGGCATATCAGAGCCTA 3; SEQ ID N0:8, IR: 
5'GTCCAATGGGGAAGTTCTCT 3; SEQ ID NO:9, 2F: 
5TCATTGTCCTGCAGAAGGTG 3; SEQ ID NO: 10, and 2R: 
5*CAGGGCTTTTCTGAGTCGTC 3; SEQ ID N0:1 1) derived from the human 

1 5 TLR4 gene (Genbank Accession Nos. U88880 and U9309 1 ). These sets of 
primers amplified a 160 bp and 140 bp product, respectively. PGR reactions 
were prepared by combining the following components: I \il of PGR lOX buffer 
(100 mM Tris-HCl, pH 8.3, 500 mM KCl), 200 ^M each of dCTP, dGTP, dATP, 
and dTTP, 0.25 ^iM of each primer, 0.2 U of AmpliTaq DNA polymerase (Perkin 

20 Elmer; Norwalk, CT) and 1 |il of the library sample in a final volume of 10 ^1. 
Thermal cycling was performed with an initial denaturation at 94°C for 3 
minutes followed by 35 cycles comprising 94°C, 55°C, and 72^C steps of 30 
seconds each and a final extension of 72°C for 5 minutes. PGR products were 
separated by electrophoresis on 2% agarose gels, stained with ethidium bromide 

25 and visualized under UV light. 

Mutation Detection . Genomic DNA was isolated from whole blood obtained 
from the study subjects using a rapid salt isolation procedure (Laitinen et al., 
1994). Overlapping primer sets were designed across the coding sequence such 
that products did not exceed 250 bp (Table 1). Primers were derived from 

30 flanking intronic sequences to include all splice sites. PGR reactions were 

prepared as described above except that 10-20 ng of genomic DNA was used at 
template. Amplification products were separated on non-denaturing, fan-cooled 
gels containing 5% acrylamide/bis (19:1), 0.5X TBE, and 2,5% glycerol for 3 
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hours at 20 W. A subset of PGR products were also run on MDE gels. The gels 
were subjected to silver staining and aberrant bands extracted from the gel, 
reamplified, and sequenced in both directions. To verify the sequence of the 
aberrant band, the same primers were used to amplify and sequence genomic 
5 DNA from each subject. At least one individual without the aberrant band was 
also sequenced for comparison. Amplification products derived from the 
extracted DNA were purified using spin columns (Qiagen; Santa Clarita, CA) 
and DNA concentration determined by spectrophotometry. The DNA sequence 
was determined with a Model 377 automated DNA sequencer (Perkin Elmer; 
10 Norwalk, CT). 
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Multi>tissue cDNA Expression Screen . Human adult (Clontech #K1420-1 ; Palo 
Alto, CA) and fetal (Clontech #K1425-1; Palo Alto, CA) multi-tissue cDNA 
panels were screened by PGR using primers derived from exon 1 (forward; 
5'GCTCACAGAAGCAGTGAGGA 3'; SEQ ID NO:60) and exon 4 (reverse; 
5 5TAGGCTCTGATATGCCCCAT3'; SEQ ID N0:6 1 ) of the human TLR4 gene. 
These PGR experiments were performed in 10 jil reactions composed of 0.025 
U/|il of DNA polymerase (BioXAGT; BioLine; Reno, NV), 1 .0 |iM of each 
primer, 200 ^iM of each dNTP, 1 ^1 of lOX buffer supplied by the manufacturer, 
and 1 [il of the cDNA sample. PGR conditions were: 95°G for 2 minutes; cycles 
10 of 0.5 minutes at 94°G, 0.5 minutes at 55°C, and 1 minute at 68°C; followed by a 
final extension step for 10 minutes at 68°C. The amplified products were 
separated on a 2% agarose gel, stained with ethidium bromide, and visualized 
under UV light. 

Statistical Analysis . The statistical analysis was designed to determine whether 
1 5 specific mutations in the TLR4 gene were associated with the airway 

hyporesponsiveness to inhaled LPS. A one-tailed test of statistical significance 
was employed (Fleiss, 1986). In a 2 X 2 analysis, a Fisher's one-tailed exact test 
was used to determine whether specific mutations of the TLR4 gene occurred 
more fi-equently in study subjects who were considered LPS hyporesponsive 
20 compared to those with a normal airway response to inhaled LPS. In addition, 
because of the nonparametric distribution of dose-response slope (percent 
decline FEV /cumulative dose of inhaled LPS), this outcome log was 
transformed and the Student's one-tailed T test (assuming unequal variances) 
was used to determine whether the dose-response slope was significantly less in 
25 subjects with a specific mutation of TLR4 compared to subjects with the 
common TLR4 allele. 

Bioassays . Cells were maintained in F12 media supplemented with 10% fetal 
calf serum (Gibco, Rockville, MD), 2 mM L-glutamine, and 10,000^units 
penicillin/streptomycin, with 1 fig/ml of G418 as selective antibiotic. Cells 
30 were transfected with a mixture of 4 \ig of DNA and 10 jil of Superfect (Qiagen, 
Valencia, CA) for a 35 mm^ dish. The DNA mix consisted of 2 \ig of DNA and 
10 ^ig NFkB reporter plasmid encoding for the luciferase gene (Clontech, Palo 
Alto, CA) and 1 jig of each TLR4 (Medzhitov et al., 1997; Genbank #U93091) 
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expression plasmid (WT or Asp299Gly). If only one expression plasmid was 
used, empty vector pcDNA3.1^ was added to keep the DNA concentration 
constant. Twenty-four hours later the cells were stimulated with 100 ng/ml of 
LPS for 6 hours. Total luciferase activity was measured using a commercially 
5 available method (Tropix, Bedford, MA). Briefly, after rising with PBS, cells 
were removed from filters by incubation with 120 fil lysis buffer (25 mM Tris- 
phosphate, pH 7.8; 2 mM DTT; 2 mM I, 2-diaminocyclohexane-N,N,N',N'- 
tetraacetic acid; 10% glycerol; and 1% Triton X-100) for 15 minutes. Light 
emission was quantified in a luminometer (Analytical Luminescence Laboratory, 

10 San Diego, CA). 

Airway epithelial cells were obtained from trachea and bronchi of lungs 
removed for organ donation. Cell were genotyped for TLR4 and isolated by 
enzyme digestion as previously described (Zabner et aL, 1996). Freshly isolated 
cells were seeded at a density of 5 x 10^ cells/cm^ onto collagen-coated, 0.6 cm^ 

15 diameter millicell polycarbonate filters (Millipore Corp., Bedford, MA). The 
cells were maintained at 37°C in a humidified atmosphere of 5% COj and air. 
Twenty-four hours after plating, the mucosal media was removed and the cells 
were allowed to grow at the air-liquid interface. The culture media consisted of 
a 1:1 mix of DMEM/Ham's F12, 5% Ultraser G (Biosepra SA, Cedex, France), 

20 100 U/ml penicillin, 100 \ig/m\ streptomycin, 1% nonessential amino acids, and 
0.12 U/ml insulin. Epithelia were tested for transepithelial resistance, and for 
morphology by scanning electron microscopy. Fourteen days after seeding, the 
basal release of IL-la was measured in WTAVT (12 specimens from 4 
individuals) and WT/Asp299Gly (24 specimens from 4 individuals) epithelia by 
/ 25 collecting the basolateral conditioned media after 24 hours (Figure 6b). The 

epithelia were then exposed to 100 ng/ml of LPS on the apical side for 6 hours, 
and the basolateral media was collected after 24 hours. IL-la was measured 
using a commercially available ELISA (R&D; Minneapolis, MN). 
To rescue the LPS hyporesponsive phenotype, heterozygote 

30 (WT/Asp299Gly) airway epithelia (Figure 6c) or homozygote 

(Asp299Gly/Asp299Gly) alveolar macrophages (Figure 6d) were transfected 
with a recombinant adenovirus vector expressing TLR4 (Rock et al. 1998; 
Genbank #U88880) that was prepared as described previously (Dandson et al., 
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1994) by the University of Iowa Gene Transfer Vector Core at titers of about 
10'^ infectious units (I.U.)/ml. Briefly TLR4 plasmid was blunt ended and 
cloned into the shuttle vector pAd5/CM Vk-NpA using the EcoRV site. Fourteen 
days after seeding of the airway epithelia (20 specimens from 4 individuals), 50 
5 MOI of the recombinant viruses (Ad/TLR4 and Ad/eGFP in phosphate buffered 
saline) were added to the basolateral surface of the epithelia for 30 minutes 
(Walters et al., 1999). After infection, the epithelia were incubated at 3TC for 
an additional 48 hours before the LPS stimulation assay. To assay for basal LPS 
response, the media was changed with fresh 500 (il of Ultroser G, and collected 

1 0 after 24 hours to measure the basal IL-1 a secretion to the basolateral side. After 
collecting the basal specimen, the epithelia were exposed to 100 ng/ml of LPS on 
the apical side for 6 hours, and the media was collected after 24 hours. To assay 
for gene transfer efficiency, the epithelia was associated with 0.05% trypsin and 
0.53 mM EDTA. Fluorescence from 50,000 individual cells was analyzed using 

15 fluorescence-activated cell analysis (FACScan, Lysys II software, Becton 

Dickinson, San Jose, CA). The percentage of GFP positive cells ranged between 
52% and 76%. Human alveolar macrophages were collected by BAL from a 
homozygote (Asp299Gly/Asp299Gly) study subject as previously described 
(Deetz et al., 1967). Macrophages were seeded onto a 96 well plate at a density 

20 of 1 0^ cells per well. The cells were infected after 4 hours with Ad/TLR4 in a 
CaPi coprecipitate (lasbinder et al, 1998) at an MOI of 50, and a Ca^ 
concentration of 1 6.8 mM. Sixteen hours after infection, the cells were exposed 
to LPS in 1% serum for 6 hours, and the media was collected. TNF-a was 
measured using a commercially available ELISA (R&D; Minneapolis, MN). 

25 Results 

Genomic Structure of the TLR4 Gene. To determine the genomic structure of 
the TLR4 gene, a bacterial artificial chromosome (BAG) library was screened by 
PGR with primers derived from the 5' and 3* ends of the cDNA sequence. 
Human BAG clone 439F3 was identified with both sets of primers and 
30 sequenced to identify exon/intro splice sites using the two published cDNA 
sequences (Medzhitov et al., 1997; Rock et al., 1998). The TLR4 cDNA 
published by Medzhitov et al. (1997) (Genbank Accession No. U93091) was 
missing a 120 bp sequence beginning at nucleotide 136 of the other published 
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cDNA (Rock et al., 1998) (Genbank Accession No. U88880). Assuming this 
missing sequence represented an alternatively spliced exon, DNA sequencing 
primers were designed to amplify across the predicted splice junctions. When 
the genomic sequence (Figure 9) was compared to the cDNA sequences, the 

5 exon/intron junctions were revealed (Figure 1). The human TLR4 gene is 
organized into 4 exons that span about 11 kb of genomic DNA. As predicted, 
the 120 bp sequence missing in the cDNA from Medzhitov et al. (1997) 
corresponds to exon 2. 

To determine whether other exons may be revealed by additional splice 

10 forms, a panel of cDNAs from adult (Figure 2a) and fetal (Figure 2b) tissues 
were examined with PGR primers designed to amplify all nucleotides between 
exons 1 and 4. In all adult and fetal tissues, three products were amplified (453, 
333, and 167 bp), although their relative amounts varied. In addition to the 3' 
end of exon 1 and the 5' end of exon 4, the 453 bp product contained both exons 

15 2 and 3, the 333 bp product was missing exon 2, and the 167 bp product was 
missing both exons 2 and 3 (Figure 2c). The 453 and 333 bp products are 
identical to the published cDNA sequences (Medzhitov et al., 1997; Rock et al., 
1998), and confirm that no other exons are present for the human TLR4 gene. 
The 167 bp product depicts alternative splicing of exons 1 and 4, and represents 

20 a novel TLR4 product. The nucleotide position of alleles are relative to the 
TLR4 cDNA sequence published by Medzhitov et al. (1997). 
Mutation Analysis. SSCV was employed to detect sequence variants across the 
entire coding region of TLR4 gene in the 83 unrelated probands who completed 
the incremental LPS inhalation challenge test. When band shifts were detected 

25 on SSCV analysis, the bands were sequenced to identify and confirm the 

sequence variants. The SSCV and sequence analysis were performed blinded to 
the LPS response phenotype of the study subjects. A band variant was detected 
by SSCV in 10 (12 %) of the 83 subjects, and direct sequencmg detected an A to 
G substitution at nucleotide 896 of the published TLR4 cDNA (Figure 3a) 

30 (Genbank Accession No. U88880). To confirm these findings, the 83 unrelated 
probands were sequenced in the forward and reverse directions with primers 
designed to amplify the 896 nucleotide; the same 10 individuals were found to 
have the A896G substitution and the remaining individuals were confirmed to 
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have the common TLR4 allele. Importantly, one of the 10 individuals with the 
A896G substitution was homozygous for this mutation and the remaining 9 had 
a single mutant allele. The allelic frequency of the A896G substitution was 6.6 
% in the study population, 7.9 % in a normal control population from Iowa 

5 (Lidral et al., 1 998), and 3.3 % in the parental chromosomes of the CEPH 
population (NIH-CEPH, 1992). 

The A896G substitution results in replacement of a conserved aspartic 
acid (A) residue with glycine (G) at amino acid 299 (Figure 4). This missense 
mutation (Asp299Gly) is in the fourth exon of TLR4 and is present in the 

10 extracellular domain of this receptor. The region surrounding amino acid 299 
appears to be in an alpha helical conformation (Gibrat et al., 1987). 
Replacement of the conserved aspartic acid with glycine at position 299 causes 
disruption of the alpha helical protein structure resulting in the formation of an 
extended beta strand (Gibrat et al., 1987). 

15 Two other SSCV variants were identified in a single proband; a T to C 

change at nucleotide 479 (Figure 3b) and a deletion of a thymine nucleotide at 
position -1 1 in the third intron. The T479C variant did not alter the amino acid 
composition and is therefore considered a silent mutation. The T-1 1 variant did 
not alter the acceptor splice site, so the significance of this deletion is unknown. 

20 The subject with these two sequence variants was responsive to inhaled LPS. 

An additional missense mutation was identified within exon 4 of the 
TLR4 gene. This mutation results in the replacement of a threonine with an 
isoleucine residue at amino acid 399, which is on the 3' side of the previously 
identified Asp299Gly mutation. Both mutations are present within the 

25 extracellular domain of the TLR4 protein. When the genotypes were compared 
with the phenotypes associated with endotoxin responsiveness, one individual 
who carried the Asp299Gly mutation did not carry the Thr39911e mutation. 
Likewise, one individual who had the Thr39911e mutation did not have the 
Asp299Gly mutation. Nine of the 83 volunteers carry both mutations. There 

30 was a strong correlation between both of the TLR4 mutations and endotoxin 
hyporesponsiveness. 

Phenotype/Genotype Analysis . Of the 83 unrelated study subjects who 
completed the LPS inhalation challenge test, 52 (63 %) were responsive to 
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inhaled LPS and 31 (37 %) were hyporesponsive to inhaled LPS. When the 
genotypes of these individuals were examined, the Asp299Gly sequence variant 
occurred in 3 LPS responsive (5.8 %) and 7 LPS hyporesponsive (22.6 %) study 
subjects (P=0.029). Among the subjects with the common TLR4 allele (N = 73), 
5 the dose-response slope (percent decline FEV /cumulative dose of inhaled LPS) 
averaged 1.86 % decline in FEV/jig inhaled LPS (range 0.01 %-19.78 %), while 
the dose-response slope for the subjects with the Asp299Gly allele (N = 10) was 
significantly less (P=0.037), averaging 0.59 % decline in FEV,/^g inhaled LPS 
(range 0.00 %-L59 %) (Figure 5). This group difference in the dose-response 

10 slope also exists on the log scale (P=0.026) where log (1 + A FEV,) was used 
since one individual has a A FEV, of 0.0. The subject who was homozygous for 
the Asp299Gly allele was hyporesponsive to inhaled LPS with a 0.28 % decline 
in FEV,/)ag of inhaled LPS. This homozygote Asp299Gly subject is one of a 
monozygote twin pair; her twin sister was subsequently phenotyped and was also 

15 found to be hyporesponsive to inhaled LPS with a 0.34% decline in FEV/^g of 
inhaled LPS. 

The biological significance of the Asp299Gly mutation was evaluated in 
several ways. First, transfection of CHO cells with either the WT or the mutant 
TLR4 gene demonstrated that the mutant allele does not respond normally to 

20 LPS stimulation (Figure 6a). Second, airway epithelia obtained from 

heterozygote (WT/Asp299Gly) individuals do not respond to LPS stimulation 
(Figure 6b). Third, the wild-type allele of TLR4 clearly restored LPS 
responsiveness in either airway epithelial cells (Figure 6c) or alveolar 
macrophages (Figure 6d) obtained from individuals with the TLR4 mutation. 

25 Discussion 

The results described herein provide the first direct evidence to indicate 
that a sequence polymorphism in the TLR4 gene is associated with a 
hyporesponsive LPS phenotype in humans that interrupts LPS signaling. This 
conclusion is supported by the following findings: 1) unrelated subjects with the 
30 Asp299Gly substitution were significantly less responsive to inhaled LPS than 
those homozygous for the common TLR4 allele; 2) an individual who was 
homozygous for the Asp299Gly substitution was hyporesponsive to inhaled 
LPS; 3) a monozygous twin pair, homozygous for the Asp299Gly substitution, 
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demonstrated a very similar response to inhaled LPS; and 4) /// vitro studies 
demonstrate that mutant TLR4 does not respond to LPS stimulation. This 
conclusion is indirectly supported by the finding in C3H/HeJ mice where a point 
mutation in intracellular domain of Tlr4 is also associated with LPS 
5 hyporesponsiveness (Poltorak et al., 1998; Qureshi et al., 1999). The identified 
Asp299Gly missense mutation is in the fourth exon of the TLR4 gene and, 
unlike the C3H/HeJ mutation, is located in the extracellular domain of this 
receptor. The findings described herein suggest that a specific region in the 
extracellular domain of TLR4 may play an important role in receptor function 

1 0 and regulation of the innate immune response to LPS in humans. 

This discovery may have important ramifications for a broad spectrum of 
human diseases. First, the Asp299Gly TLR4 substitution is a relatively common 
mutation, potentially affecting the responsiveness of a substantial portion of the 
population to LPS. The allelic frequency of the Asp299Gly TLR4 substitution in 

15 the three populations screened was between 3.3 % and 7.9 %, and homozygous 
individuals were identified in each of the populations. Second, the results 
described herein provide further support for the role of TLR4 in LPS signaling. 
A fundamental understanding of LPS signaling will undoubtedly alter 
approaches to gram-negative sepsis, as well as other diseases thought to be 

20 mediated by endotoxin, such as the systemic inflammatory response syndrome 
(Wang et al., 1995), acute respiratory distress syndrome (Brighan et al., 1986), 
and asthma or other forms of airway disease caused (Rylander et al., 1 989; 
Schwartz et al., 1995) or exacerbated (Schwartz et al., 1995; Michel et al., 1996) 
by endotoxin. Identification of the essential components of LPS signaling also 

25 provide new therapeutic targets for endotoxin mediated conditions. Third, the 
specific Asp299Gly TLR4 substitution may provide a simple screening 
mechanism to risk stratify a population. Identification of this mutation may lead 
to a better appreciation of the role of LPS responsiveness in a broad range of 
acquired and genetic disorders. For example, mutations of the TLR4 gene may 

30 explain why a minority of patients with gram-negative sepsis develop ARDS 
(Hudson et al., 1995) or why patients with cystic fibrosis and the identical CFTR 
mutation have different courses of disease (Veeze et al., 1994). Finally, it must 
be acknowledged that normal responsiveness to endotoxin is an important 
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component of innate immunity, and the Asp299Gly TLR4 substitution may 
prove to be associated with negative clinical outcomes. In fact, the C3H/HeJ 
mouse is more susceptible to Salmonella typhimurium (O'Brien et al., 1980), 
despite its resistance to LPS (Sultzer et al., 1968). Although individuals with the 
5 Asp299Gly tLR4 substitution may be more resistant to localized forms of 
endotoxin-induced inflammation, these individuals may prove to be more 
susceptible to a systemic inflammatory response initiated or exacerbated by 
endotoxin. 

The findings described herein demonstrate that a specific region in the 

1 0 extracellular domain of TLR4 plays an important role in receptor function and 
regulation of the innate immune response to LPS in humans. The sequence 
variants reported thus far for mammalian, i.e., murine, TLR4 are located in the 
cytoplasmic domain (Poltorak et al., 1998; Qureshi et al, 1999). While it is 
logical to hypothesize that mutations in the cytoplasmic domain of TLR4 disrupt 

1 5 the signaling pathway leading to. activation of NF-kB and AP-1 (Medhitov et al., 
1998), the potential mechanisms that are altered by mutations in extracellular 
domain of TLR4 are less obvious. 

TLR4 is a transmembrane protein and it is therefore not surprising to find 
a putative signal peptide at the N-terminus, presumably responsible for proper 

20 trafficking to the cell membrane (Medzhitov et al., 1997). This finding raises the 
possibility that sequence variants in the extracellular domain of TLR4 can 
disrupt trafficking of this receptor to the cell membrane and enhance proteolysis. 
There are several examples in the literature which show that the presence of a 
single point mutation at a conserved amino acid can disrupt protein folding and 

25 specifically affect protein trafficking (Jackson et al., 1 998; DeFranco et al., 
1998). Mutations of this kind could give rise to a range of phenotypes from 
almost normal to completely unresponsive, depending on the severity of the 
trafficking defect. Another possible mechanism that may explain the association 
between a mutation in the extracellular domain of TLR4 and LPS responsiveness 

30 is impairment of hgand binding to the extracellular domain. The extracellular 
domains of the toll proteins are clearly involved in the cellular response to LPS 
(Medzhitor et al., 1997; Yang et al., 1998). Yet, so far no convincing evidence 
has been found that the toll proteins are directly interacting with LPS (Wright, 
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1999). The discovery of an association between a mutation in the extracellular 
domain of TLR4 and a discemable LPS phenotype in humans should aid in 
clarifying whether the extracellular domain of this receptor is important for 
proper membrane localization of TLR4 and if LPS directly interacts with TLR4. 
5 A final possibility is that the missense mutation of TLR4 affects its interaction 
with either an extracellular ligand or an intracellular protein (Wright, 1999). The 
extracellular domain of TLR4, and specifically the Asp299Gly amino acid 
change may provide a key therapeutic target to modulate LPS signaling. 
TLR4 may be one of several factors that may regulate the airway 

1 0 response to inhaled LPS. Among the subjects with the Asp299Gly substitution, 
7 were hyporesponsive to LPS and 3 developed airflow obstruction during the 
LPS inhalation challenge test. None of the subjects with this mutation were in 
the most sensitive quartile of the LPS responders. Although asthmatic patients 
were included because of their increased airway responsiveness to inhaled LPS 

1 5 (Michel et al., 1989), it remains possible that the 3 subjects with the Asp299Gly 
substitution who were responsive to inhaled LPS had some type of airway 
inflammation (e.g., early viral illness) that enhanced their response to inhaled 
LPS. Moreover, since 22.6 % of the study subjects who were hyporesponsive to 
LPS had at least one copy of the Asp299Gly TLR4 allele, other genes (or 

20 possibly acquired conditions) may prove to play a role in modulating the 

biological response to LPS. For instance, TNF-a is one of the primary cytokines 
that mediates the toxic effects of LPS (Beutler et al., 1985). A polymorphism at 
-308 in the TNF-a promoter region results in higher constitutive and inducible 
levels of TNF-a (Wilson et al., 1997), and this polymorphism has been shown to 

25 result in a worse outcome in children with either meningococcal disease (Nadel 
et al., 1996) or cerebral malaria (McGuire et al., 1994). Likewise, allelic variants 
of TLR4 receptors and their ligands appear to delay the progression of disease in 
patients with HIV infection (Mummidi et al,, 1995). Thus, TLR4 appears to 
represent only one of a number of genetic variants that may modulate the 

30 pathophysiologic response to LPS. 
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Example 2 

Methods 

To genotype patients for TLR4, a PGR based RFLP assay was employed. 
For example, to detect point mutations in codon 299 such as those that result in 

5 an amino acid substitutions at amino acid 299 (Asp (GAT) to Gly (GGT) 
change) and/or in codon 399 (Thr (ACC) to He (ATC) change), a primer 
spanning that codon and a second primer 3' or 5' to that primer are prepared. 
Primers useful to detect a nucleotide change at codon 299 are: 299Forward 
primer (5' GATTAGCATACTTAGACTACTACCTCCATG 3'; SEQ ID 

1 0 NO:66) and 299Reverse primer (5 ' 

GATCAACTTCTGAAAAAGCATTCCCAC; SEQ ID NO:67). The underlined 
base in the forward primer fortuitously creates a Ncol (CCATGG) restriction site 
in the context of a mutation which is specific for carriers of the 299 mutation 
Asp to Gly. Primers useful to detect a nucleotide change at codon 399 include: 

1 5 399Forward primer (5 ' GGTTGCTGTTCTC AAAGTGATTTTGGG AGA A; 
SEQ ID NO:68) and 399Reverse Primer (5' 

ACCTGAAGACTGGAGAGTGAGTTAAATGCT; SEQ ID NO:69). The 
underlined base in the 399 Forward primer fortuitously creates a Hinfl restriction 
site (GANTC) in the context of a mutation which is specific for carriers of the 

20 399 mutation Thr to He. The reverse primers were chosen at random based 
solely on such parameters as annealing temperature, expected product size and 
the like. Additional residues or fewer residues at the 5' end of the primers 
having the nucleotide change will not affect the outcome as long as the 
amplification conditions and reverse primers are chosen accordingly. 

25 A MJ Tetrad PTC-225 Thenmo cycler was used with both primer sets. 

For the 299 assay, primers at 10-100 pmole were mixed with 20 ng of genomic 
DNA, and Perkin Elmer Taq polymerase, deoxyribonucleotides and buffer 
according to the manufacturer's directions. Cycling conditions included an 
initial denaturation of 95*'C, for 3 to 4 minutes, then 30 cycles of denaturation at 

30 95°C for 30 seconds; annealing at 55°C for 30 seconds; and extension at 72°C 
for 30 seconds. 

For the 399 assay, a Clontech Advantage amplification kit was used 
according to the manufacturer's directions. The cycling conditions included an 
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initial denaturation at 95°C for 3 to 4 minutes, then 30 cycles of denaturation at 
95°C for 30 seconds; annealing at 55*^0 for 30 seconds; and extension at 72°C 
for 30 seconds. 

R£SUilS 

5 Septic shock remains a significant health concern world-wide and despite 

progress in understanding the physiologic and molecular basis of septic shock, 
the high mortality rate of septic shock patients remains unchanged. A common 
polymorphism in TLR4 was identified that is associated with 
hyporesponsiveness to inhaled endotoxin or lipopolysaccharide (LPS) in humans 

10 (Example 1). Since TLR4 is a major receptor for LPS in mammals and Gram- 
negative bacteria are the prevalent pathogen associated with septic shock, these 
specific TLR4 alleles may be associated with a predisposition to or a more 
severe disease outcome for septic shock patients. Ninety-one septic shock 
patients as well as 83 healthy blood donor controls were genotyped for the 

1 5 presence of the TLR4 Asp299Gly and TLR4 Thr399Ile mutations. The TLR4 
Asp299Gly allele was found exclusively in septic shock patients (P = 0.05). 
Furthermore, septic shock patients with the TLR4 Asp299Gly allele had a 
significantly higher prevalence of Gram-negative infections. These findings 
suggest that the TLR4 Asp299Gly allele predisposes people to develop septic 

20 shock with Gram-negative organisms. 

In another study in a German population, the following observations 
were made: patients with TLR4 mutation(s) tended to have a longer ICU stay; 
patients with TLR4 mutation(s) were more prone to sepsis; and sepsis in 
patients with TLR4 mutation(s) was more severe than in patients that were wild- 

25 type for TLR4. 

In a third study in a Finnish population, it was observed that mothers of 
pre-term infants and pre-term infants had a higher fi-equency of TLR4 
mutation(s) than term infants. Also, infants with respiratory distress syndrome 
(RDS), particularly those bom very prematurely, tended to have a higher 

30 fi-equency of mutation(s) than premature infants without respiratory distress 
syndrome. 
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Thus, generally, the 299 mutation is slightly more prevalent than the 399 
mutation, and in 90% of the cases, both mutations were observed. When only 
one mutation was found, it w^as more frequently the 299 mutation. 

Thus, the presence of TLR4 mutations is associated with predisposition 
5 to septic shock, severity of sepsis, pre-term delivery, and RDS in pre-term 
infants. 
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the basic principles of the invention. 
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WHAT IS CLAIMED IS: 

1 . An isolated and purified DNA molecule comprising a human genomic 
DNA segment encoding TLR4, a biologically active fragment or variant 
thereof. 

2. The DNA molecule of claim 1 wherein the DNA segment encodes a 
variant TLR4. 

3. The DNA molecule of claim 2 wherein the variant TLR4 has at least one 
amino acid substitution in the extracellular domain of TLR4. 

4. The DNA molecule of claim 3 wherein the variant TLR4 comprises a 
glycine residue at position 299 or an isoleucine residue at position 399. 

5. An isolated and purified nucleic acid molecule comprising a human 
nucleic acid segment encoding a variant TLR4, or a biologically active 
fragment thereof. 

6. The nucleic acid molecule of claim 5 wherein the variant has at least one 
amino acid substitution in the extracellular domain of TLR4. 

7. The nucleic acid molecule of claim 6 wherein the variant comprises a 
glycine residue at position 299 or an isoleucine residue at position 399. 

8. An expression cassette comprising a promoter operably linked to a 
human DNA segment encoding a variant TLR4, or a biologically active 
fragment thereof. 

9. The expression cassette of claim 8 wherein the variant TLR4 has at least 
one amino acid substitution in the extracellular domain of TLR4. 
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10. The expression cassette of claim 8 wherein the variant TLR4 comprises a 
glycine residue at position 299 or an isoleucine residue at position 399. 

11. A host celK the genome of which is augmented by the DNA molecule of 
claim 1. 

1 2. A host cell, the genome of which is augmented by the nucleic acid 
molecule of claim 5. 

13. A method to prepare a variant TLR4, comprising: culturing a host cell 
transformed with the nucleic acid molecule of claim 5 operably linked to 
a promoter, so that said host cell expresses the variant TLR4. 

14. A method to prepare a variant TLR4, comprising: culturing a host cell 
transformed with the DNA molecule of claim 1 operably linked to a 
promoter, so that said host cell expresses the variant TLR4. 

15. The method of claim 13 or 14 further comprising isolating TLR4 from 
the host cell. 

16. A method of identifying a human at risk of, or having, an indication 
associated with altered innate immunity, comprising: 

a) contacting an amount of DNA obtained from a human physiological 
sample with an amount of at least one TLR4-specific oligonucleotide 
under conditions effective to amplify the DNA so as to yield amplified 
DNA; and 

b) detecting or determining whether the amplified DNA encodes at least a 
portion of a variant TLR4. 

17. The method of claim 16 wherein the nucleic acid sample comprises 
genomic DNA. 

18. The method of claim 16 wherein the nucleic acid sample is cDNA. 
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19. The method of claim 16 wherein the amplified DNA is subjected to 
electrophoresis in step b). 

20. Isolated, variant human TLR4 polypeptide, or a biologically active 
fragment thereof. 

21. The isolated polypeptide of claim 20 wherein the variant TLR4 has at 
least one amino acid substitution in the extracellular domain of TLR4. 



22. The isolated polypeptide of claim 21 wherein the variant TLR4 
comprises a glycine residue at position 299 or an isoleucine residue at 
position 399. 

23. A method to identify an agent that alters TLR4 activity, comprising: 

a) contacting a mammalian cell which expresses TLR4 with the agent; 

and 

b) detecting or determining whether the agent alters TLR4 activity. 

24. The method of claim 23 wherein the mammalian cell is transformed with 
a recombinant nucleic acid molecule encoding human TLR4. 

25. An agent identified by the method of claim 24. 

26. A method to alter TLR4 activity, comprising: contacting a mammalian 
cell with an agent effective to alter TLR4 activity. 

27. The method of claim 26 wherein the mammalian cell is a human cell. 



28. A method to detect a polymorphism in a human TLR4 gene, comprising: 
detecting or determining whether a nucleic acid sample obtained from a human 
comprises nucleic acid encoding variant TLR4. 
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29. A transgenic mouse whose cells comprise a chimeric DNA sequence, 
wherein the chimeric DNA sequence comprises a DNA segment encoding 
human TLR4, wherein the chimeric DNA sequence is integrated into the genome 
of the cells of the mouse, and wherein the DNA segment is expressed in the 
transgenic mouse so as to result in the transgenic mouse exhibiting altered innate 
immunity relative to a corresponding nontransgenic mouse. 

30. The transgenic mouse of claim 29 wherein the DNA segment encodes a 
variant human TLR4. 

3 1 . The method of claim 16 wherein the TLR4-specific oligonucleotide 
comprises at least two nucleotide substitutions relative to the wild-type TLR4 
nucleic acid sequence, which substitutions result in a restriction site that is not 
present in the wild-type TLR4 nucleic acid sequence and which restriction site is 
indicative of DNA that encodes a variant TLR4. 

32. The method of claim 31 wherein the DNA encodes a variant TLR4 
having an amino acid substitution at residue 299. 

33. The method of claim 31 wherein the oligonucleotide comprises SEQ ID 
NO:66. 

34. The method of claim 3 1 wherein the DNA encodes a variant TLR4 
having an amino acid substitution at residue 399. 

35. The method of claim 3 1 wherein the oligonucleotide comprises SEQ ID 
NO:68. 
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1 

SEQUENCE LISTING 

<110> University of Iowa Research Foundation 
5 Schwartz, David A. 

Schutte, Brian C. 

<120> Variant TLR4 nucleic acid and uses thereof 

10<130> 875.010WO1 

<150> US 09/329,515 
<151> 1999-06-10 

15<160> 72 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
20<211> 28 
<212> DNA 

<213> Homo sapiens 
<400> 1 

25gcgtggaggt atgtggctgg agtcagct 

<210> 2 
<211> 28 
<212> DNA 
30<213> Homo sapiens 

<400> 2 

tcacggaggt tagaatgctg agcacgta 

35<210> 3 
<211> 28 
<212> DNA 

<213> Homo sapiens 



40<400> 3 

ttatccaggt aatgaatcca cttttaca 
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2 

<210> 4 
<211> 21 
<212> PRT 

<213> Homo sapiens 

5 

<400> 4 

Leu Ala Tyr Leu Asp Tyr Tyr Leu Asp Asp lie He Asp Leu Phe Asn 

1 5 10 15 

Cys Leu Thr Asn Val 
10 20 

<210> 5 
<211> 20 
<212> PRT 
15<213> Mus musculus 

<400> 5 

Leu Thr Tyr Thr Asn Asp Phe Ser Asp Asp He Val Lys Phe His Cys 
1 5 10 15 

20Leu Ala Asn Val 
20 

<210> 6 
<211> 20 
25<212> PRT 

<213> Rattus norvegius 

<400> 6 

Leu Thr Tyr He Asn His Phe Ser Asp Asp He Tyr Asn Leu Asn Cys 
30 1 .5 10 15 

Leu Ala Asn. He 
20 

<210> 7 
35<211> 20 
<212> PRT 

<213> Cricetulus griseus 
<400> 7 

40Phe Thr Tyr Ala Asn Glu Phe Ser Glu Asp He Thr Asp Phe Asp Cys 
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3 

1 5 10 15 

Leu Ala Asn Val 
20 

5<210> 8 
<211> 20 
<212> DNA 

<213> Homo sapiens 

10<400> 8 
atggggcata tcagagccta 

<210> 9 
<211> 20 
15<212> DNA 
<213> Homo sapiens 

<400> 9 

gtccaatggg gaagttctct . 

20 

<210> 10 
<211> 20 
<212> DNA 

<213> Homo sapiens 

25 

<400> 10 

tcattgtcct gcagaaggtg 

<210> 11 
30<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 11 
35cagggctttt ctgagtcgtc 

<210> 12 
<211> 20 
<212> DNA 
40<213> Homo sapiens 



wo 00/77204 

<400> 12 

ctgctcggtc aaacggtgat 

<2io> 13 

5<2ll> 20 

<212> DNA 

<213> Homo sapiens 

<400> 13 
lOcagcaagcac gatattggat 

<210> 14 
<211> 20 
<212> DNA 
15<213> Homo- sapiens 

<400> 14 

gagttgggag accatgcagt 

20<210> 15 

<211> 20 

<212> DNA 

<213> Homo sapiens 

25<400> 15 

ggttcttatt cagcagaaat 

<210> 16 
<211> 20 
30<212> DNA 

<213> Homo sapiens 

<400> 16 

ggtggctgtg gagacaaatc 

35 

<210> 17 

<211> 20 

<212> DNA 

<213> Homo sapiens 

40 
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<400> 17 
acttggacct 

<210> 18 

5<211> 19 

<212> DNA 

<213> Homo 



ttccagcaac 



sapiens 



<400> 18 
lOctttatccaa ccaggtgca 



<210> 19 
<211> 23 
<212> DNA 
15<213> Homo sapiens 

<400> 19 

tgggagaatt tagaaatgaa gga 2 3 

20<210> 20 
<211> 21 
<212> DNA 

<213> Homo sapiens 
25<400> 20 

tttcttcatt ttccctggtg a 21 



<210> 21 
<211> 20 
30<212> DNA 

<213> Homo sapiens 

<400> 21 

tggacagttt cccacattga 

35 

<210> 22 

<211> 22 

<212> DNA 

<213> Homo sapiens 

40 
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<400> 22 

ttcaaaggtt gctgttctca aa 22 

<210> 23 
5<211> 23 
<212> DNA 
<213> Homo sapiens 

<400> 23 

lOtcaaacttct tgggcttaga aca 23 

<210> 24 
<211> 20 
<212> DNA 
15<213> Homo sapiens 

<400> 24 

cagagttgct ttcaatggca 20 

20<210> 25 

<211> 20 

<212> DNA 

<213> Homo sapiens 

25<400> 25 

tccaggaaaa cttccttcca 20 

<210> 26 
<211> 18 
30<212> DNA 

<213> Homo sapiens 

<400> 26 

ttcattggat acgtttcc 18 

35 

<210> 27 

<211> 20 

<212> DNA 

<213> Homo sapiens 

40 
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<400> 27 

accagagttt cctgcaatgg 

<210> 28 
5<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 28 
lOtgcctgtgct gagtttgaat 

<210> 29 
<211> 20 
<212> DNA 
15<213> Homo sapiens 

<400> 29 

cggtcctcag tgtgcttgta 

20<210> 30 
<211> 20 
<212> DNA 

<213> Homo sapiens 

25<400> 30 

ccaggatgag gactgggtaa 

<210> 31 
<211> 20 
30<212> DNA 

<213> Homo sapiens 

<400> 31 

aagccgaaag gtgattgttg 

35 

<210> 32 

<211> 23 

<212> DNA 

<213> Homo sapiens 

40 
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20 



20 



20 
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8 

<400> 32 

tatcatcttc attgtcctgc aga 

<210> 33 

5<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 33 
lOtcattgtcct gcagaaggtg 

<210> 34 
<211> 20 
<212> DNA 
15<213> Homo sapiens 

<400> 34 

gacgactcag aaaagccctg 

20<210> 35 
<211> 20 
<212> DNA 

<213> Homo sapiens 

25<400> 35 

aattggcagg aagcaacatc 

<210> 36 
<211> 21 
30<212> DNA 

<213> Homo sapiens 

<400> 36 

agtcagctcc tctgaacttt c 

35 

<210> 37 
<211> 20 
<212> DNA 

<213> Homo sapiens 

40 
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<400> 37 

cgtgctcagc attctaacct 

<210> 38 

5<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 38 
lOgaacacctca ccttgtgcag 

<210> 39 
<211> 21 
<212> DNA 
15<213> Homo sapiens 

<400> 39 

cttgatagtc cagaaaaggc t 

20<210> 40 
<211> 20 
<212> DNA 

<213> Homo sapiens 

25<400> 40 

ccgcaagtct gtgcaataaa 

<210> 41 
<211> 24 
30<212> DNA 

<213> Homo sapiens 

<400> 41 

gtcagcttat gaagcctaat ttct 

35 

<210> 42 

<211> 22 

<212> DNA 

<213> Homo sapiens 

40 
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<400> 42 

caaattgcac aggccctcta ga 

<210> 43 

5<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 43 
lOaaagtctttt accctttcaa tagtca 

<210> 44 
<211> 23 
<212> DNA 
15<213> Homo sapiens 

<400> 44 

agagatttga gtttcaatgt ggg 

20<210> 45 

<211> 20 

<212> DNA 

<213> Homo sapiens 

25<400> 45 

ctttaggctg gttgtcccaa 

<210> 46 
<211> 24 
30<212> DNA 

<213> Homo sapiens 

<400> 46 

tgaaaactca ctcatttgtt tcaa 

35 

<210> 47 

<211> 20 

<212> DNA 

<213> Homo sapiens 

40 
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<400> 47 

cttcgagact ggacaagcca 

<210> 48 

5<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 48 
lOgagaggtcca ggaaggtcaa 

<210> 49 
<211> 20 
<212> DNA 
15<213> Homo sapiens 

<400> 49 

acctggaggg agttcagaca 

20<210> 50 

<211> 20 

<212> DNA 

<213> Homo sapiens 

25<400> 50 

aagagctgcc tctggtcctt 

<210> 51 
<211> 20 
30<212> DNA 

<213> Homo sapiens 

<40b> 51 

tacaagcaca ctgaggaccg 

35 

<210> 52 
<211> 20 
<212> DNA 

<213> Homo sapiens 

40 
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20 



20 



20 



20 



20 
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<400> 52 

tttatgcagc cagcaagaag 

<210> 53 

5<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 53 
lOggaggcaccc cttcttctaa 

<210> 54 
<211> 20 
<212> DNA 
15<213> Homo sapiens 

<400> 54 

gcggctctgg atgaagtgct 

20<210> 55 

<211> 20 

<212> DNA 

<213> Homo sapiens 

25<400> 55 

ctgagcaggg tcttctccac 

<210> 56 
<211> 20 
30<212> DNA 

<213> Homo sapiens 

<400> 56 

agatgttgct tcctgccaat 

35 

<210> 57 
<211> 20 
<212> DNA 

<213> Homo sapiens 

40 
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<400> 57 

cagggctttt ctgagtcgtc 

<210> 58 
5<211> 20 
<212> DNA 

<213> Homo sapiens 



<400> 58 

20 



lOtgaacaagtg ttggacccag 

<210> 59 
<211> 21 
<212> DNA 
15<213> Homo sapiens 



<400> 59 

gattagcagc cctgcatatc t 

20<210> 60 
<211> 20 
<212> DNA 

<213> Homo sapiens 

25<400> 60 

gctcacagaa gcagtgagga 

<210> 61 
<211> 20 
30<212> DNA 

<213> Homo sapiens 

<400> 61 

taggctctga tatgccccat 

35 

<210> 62 
<211> 10665 
<212> DNA 

<213> Homo sapiens 

40 
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<220> 

<221> misc_feature 
<222> {!)... (10665) 
<223> n = A,T,C or G 

5 



<400> 62 



aaaatactcc 


cttgcctcaa 


aaactgctcg 


gtcaaacggt 


gatagcaaac 


cacgcattca 


60 


cagggccact 


gctgctcaca 


aaaccagtga 


ggatgatgcc 


aggatgatgt 


ctgcctcgcg 


120 


cctggctggg 


actctgatcc 


cagccatggc 


cttcctctcc 


tgcgtgagac 


cagaaagctg 


180 


lOggagccctgc gtggaggtat gtggctggag 


tcagctcctc 


tgaactttcc 


ctcacttctg 


240 


cccagaactt 


ctcactgtgt 


gccctggttt 


gtttattttt 


gcaaaaaaaa 


aaagagttaa 


300 


attaccttaa 


agactcaaga 


agccacagag 


atcaaataat 


tcattgttac 


agggcactag 


360 


aggcagccat 


tgggggtttg 


ttccatttgg 


aaattttgag 


tgctaacagg 


ggcatgagat 


420 


aacatagatc 


tgcttaaggt 


ccctgctctg 


ctaccttgtg 


gctctgtgaa 


gaaattatca 


480 


ISaacctgtctg agactagttt 


tcgcatctgt 


aagagaatta 


taataccttc 


ttcactagag 


540 


agtaagcaga 


ctgcttcagt 


gtcatttctt 


cccactggtg 


gtctttacac 


tcagcttcaa 


600 


gcagtcaccc 


tgctcctttc 


aatctcagga 


aaaagatggc 


tttgtgtgtg 


tgtctctaga 


660 


gaaagaactt 


tctaagttgg 


tgcagacttc 


tgtatgcagt 


aatatagttt 


agtccagagg 


720 


atgaaaaaaa 


taagagaatg 


aaaaaggaaa 


agagagagag 


agagaagaaa 


aaagcaagag 


780 


20ggaaatatgt 


ataatgtcag 


ctaatgcaac 


agtttctttc 


ttagtgaaat 


accaatcagc 


840 


tggttggtaa 


tcttattcat 


gatggatctc 


ttttgttttt 


cccctgcgca 


gacttcacag 


900 


ttgctttaga 


aacccatagt 


agagccgaac 


agctaagaaa 


atgatttaca 


gtgaggcagg 


960 


gtcagaaact 


caagagagaa 


aaagccagct 


gcagtcctga 


agttgaggat 


ataggagaaa 


1020 


atcaagtaat 


atttagcaaa 


gactaattca 


ttatcttgaa 


gccatccctt 


ccctcaattc 


1080 


25cctgcccata gtcctcctcc 


ttgtcctctt 


ctctgnatcc 


ctctgctgtt 


aggttaatgg 


1140 


agatagattt 


tctaattang 


ctcactgcga 


gataaaaccc 


agcccatgtt 


tctattagnc 


1200 


aatattgtct 


ttgaggctcc 


atggcttgca 


ncatttaagc 


agacatacga 


atgaagatct 


1260 


gcatgtttga 


actctgactt 


tgcgcatatt 


acttcatttc 


tttgaatttc 


cattttcctc 


1320 


atctttaaat 


gcttatttga 


agattaagtg 


aaagtatata 


acaaacaaga 


actatgcagg 


1380 


30cgtatggtaa 


gggattaatg 


atagatgata 


ataattaatg 


ttgacatcta 


ttgatcactt 


1440 


atactgtagc 


gggcttttaa 


ataaactctt 


taaacacctt 


atctcattta 


atccttcaaa 


1500 


cattctattg 


gtttcaaaca 


acagaaaact 


acaattagct 


ggcttctgca 


aggaattttg 


1560 


ttggaggaaa 


tgagagcatt 


cagaaattag 


atgggagcgt 


tagagaatta 


ggcttacaaa 


1620 


gaatgtggga 


aagtaggcta 


gaaagcagtg 


taaaaacaaa 


gacagcataa 


agcacttgac 


1680 


35cttatttact 


aggttccacc 


atgggaatcc 


atgcactcta 


aagatttccc 


cctatttcta 


1740 


catcactttg 


ctcaagggtc 


aatgagccaa 


ggaaaagaat 


gcagttgtca 


aaatctgggc 


1800 


catgactaag 


gaaggtctgg 


acatcttgac 


tgccagacag 


tctccccaat 


gatatggagt 


1860 


atttagaatg 


atactggata 


ttttatttat 


tttttgtatt 


ttcaactttt 


aagttcagag 


1920 


gcacatgtgc 


agagcatgca 


ggtttattac 


ataagtaaat 


gtgtgccatg 


gtgatttgct 


1980 


40gcatagatca 


tgaaaatatg 


gaacgcatca 


tggatttgtg 


tgtcatcctt 


gtgcaggggc 


2040 



wo 00/77204 PCTAJSOO/15723 

15 



catgctcatc 


ttctctgtat 


ccttccaatt 


ttagtatatg 


tgctactgca 


gcaagcacga 


2100 


tattggatat 


tttattacct 


acattttaca 


tatgataaaa 


tgaggctcac 


tgaggttttt 


2160 


cttttgttcg 


ttttattttg 


ttttgttttt 


aaagacttgg 


ccctaaacca 


cacagaagag 


2220 


ctggcatgaa 


acccagagct 


ttcagactcc 


ggagcctcag 


cccttcaccc 


cgattccatt 


2280 


Sgcttcttgct 


aaatgctgcc 


gttttatcnc 


ggaggttaga 


atgctgagca 


cgtagtaggt 


2340 


gctctttact 


ttctaatcta 


gagtaagaca 


atttataagc 


atgaattgag 


tgaatggatg 


2400 


gatggatata 


tggatggaag 


gatggacaga 


tggatgaaag gttgactgaa 


ttttgtgctt 


2460 


gcacaaaaag 


aggcccctct 


ccaccatctc 


tggtctagga 


gaggggagtt 


gggagaccat 


2520 


gcagtaaaga 


tacttcatgt 


catgtgtaat 


cattgcaggt 


ggttcctaat 


attacttatc 


2580 


lOaatgcatgga gctgaatttc 


tacaaaatcc 


ccgacaacct 


ccccttctca 


accaagaacc 


2640 


tggacctgag 


ctttaatccc 


ctgaggcatt 


taggcagcta 


tagcttcttc 


agtttcccag 


2700 


aactgcaggt 


gctggattta 


tccaggtaat 


gaatccactt 


ttacatactg 


cacaaggtga 


2760 


ggtgttcatt 


gtcctatcat 


ttcattattg 


gactggaaag 


cttggtttgt 


ggagtctcat 


2820 


cttcattcac 


ttattcattc 


atacaacaga 


tgtcttatta 


actatataac 


cttgagcaag 


2880 


ISctacctctat 


tctccaggtc 


tcagttttct 


aatctgtgaa 


gtaggcagtt 


ggctgagaca 


2940 


gcttctaagg 


gcaattctaa 


ttttaggttt 


tcttttaaga 


caggagagaa 


aattagctta 


3000 


aattctttca 


taagcagcta 


tttattgact 


acttgctata 


tgttgtacac 


tctgcaagaa 


3060 


gacaggcata 


tattgatata 


taacacacag 


cccctgttgt 


taaggaggca 


tatcttcttg 


3120 


aaagagttaa 


taccttaaag 


tcctgggtat 


ggtcctgggt 


acatagtata 


tagtcaacac 


3180 


20attttaatta 


tgattttttg 


gatctggaaa 


ctgatataaa 


gatagcgaca 


tataacagta 


3240 


ggtgataaat 


tatgtttaaa 


ctaaaggtaa 


ctaattgtat 


ttttcagaag 


aggggccttc 


3300 


tctgtggtgg 


gtagtcaaga 


aagattcatg 


aactgcataa 


gattcaaaca 


atgtctagaa 


3360 


tattaaaact 


agtggtggca 


ggtgaaatgt 


catcttgata 


ttttagggga 


accaaattct 


3420 


aaaagggttt 


tcatcatcgg ggccttattt 


gcaaatcgaa 


ctagataatg 


gatcatgttc 


3480 


25tctgcaatgg tttgtaaaac 


atttcaaaac 


attttacata 


ttttttatta 


tagaaattat 


3540 


tgataaagac 


taaggtcaca 


gtataaaaat 


cctttttaga 


gcagacattt 


ctgtagaaga 


3600 


gtgaacatat 


gacctattat 


actctaattt 


ggatatagat 


aggatgtaac 


aaaggagtaa 


3660 


tgggaacaat 


tcaaaggcag 


tggtatagtg 


catanagtcc 


tgttggggtc 


agaagacctg 


3720 


agcccaagtt 


tacccccaac 


atttataacc 


catgtaacct 


tagcatatta 


cttcatctcc 


3780 


30cttaatcctt 


agtttcatat 


ctgatcaatg 


gaaatgatga 


aacttattct 


gctggattaa 


3840 


atgtgataat 


aaatattaat 


atgctgtata 


tatttaaatt 


tttataaaat 


atattttata 


3900 


agcataaagt 


attcttacag 


aatttcatta 


ggtttttaaa 


ataatttcaa 


cttttatttt 


3960 


tgattcaggg 


atttacatgg 


ttatattgcg 


taatgctgag 


gtgtagggta 


caatcgatac 


4020 


catcactcag 


gtagtgagca 


tagtacccaa 


tagttagttt 


ttcaaccctt 


gctgctttct 


4080 


35ctctatcccc 


tctctagtaa 


tccccagggt 


ctatttttgt 


catctttatg 


tccatgtgta 


4140 


ctccatgttt 


ggatcctact 


tataaagtga 


gaactcatgg 


tatttggctt 


tctgtncctt 


4200 


tgttngctaa 


tttgcttagg 


ataatggcta 


ctagctgcat 


ctatgccatt 


atgttctaaa 


4260 


tttcanttnc 


ctgcatgaaa 


attttgtcaa 


gtactctatt 


aaggtagacc 


acctctccct 


4320 


ttttttttca 


aacaagaagt 


agnttttccc 


aaacaatgcc 


cttatggaat 


tnatcttcaa 


4380 


40tccnnggata 


cccaataact 


tgccccaaan 


ccttaatctg 


ncttacagag 


aggccacctt 


4440 
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ccttctgtaa 


cccataggag 


atttggattg 


gtaagaatgc 


tttgtgatag 


cccagcagcc 


4500 


ttctttcccc 


tatagaaata 


tatatatant 


ctttttatag 


gtgaggaact 


gaagcttgaa 


4560 


taatttaaat 


gacttatata 


catnatcatt 


gcttgttagc 


cacagaccag 


agacttaagt 


4 62 0 


tcncatctcc 


agaatccaac 


ttaaatgttt 


tctttgtctt 


aatactctac 


ttccctaaag 


4680 


Stgattatcac 


caatgtaatg 


atatagagnc 


acagcaagac 


cctttccttc 


tcacctaatg 


474 0 


tatagagcaa 


tgcagagata gaatgatggg 


ctataacaat 


catataattg 


aaagaaagaa 


4800 


cttcaaaaat 


aatcaagttc 


agctgtttga 


tttataaatg 


tgataactaa 


aacctagaga 


4860 


ggaaaagagg 


tactcaagat 


cacacagtag gagaggactg 


cagaaacacc 


aaacccaagc 


4920 


tcttttgtcc 


actcttccag 


cgttctttct 


actatactgc 


ctatccttta 


tctagttacc 


4980 


lOaataaataac 


aaaagcttgg 


accacaatgc 


ttttattgtc 


taggaaactc 


ctgaagaagc 


5040 


taaataaaat 


gggtggggaa 


tattgtaaat 


gtaattcagg 


ctggattaag 


aaagaactta 


5100 


tttgacattg 


taactgacaa 


gcacctgcaa 


tgctgaaagg 


aatttttcat 


tggcntgctg 


5160 


tttgctgggc 


tgcatcaaag 


ccctgtctct 


aggacatgtc 


tctgaacatt 


gtgtgtagca 


5220 


tggctttcat 


ttcttttagg 


ataaaattca 


aaacccttta 


tctggttggt 


aaacctctgc 


52 80 


ISctaattggga 


accttctttc 


tccacaactc 


catattgtac 


actccaattt 


catctctgtt 


5340 


ctccaaccat 


ggaagctatt 


tgtcatgatt 


cctccttgtg 


tcattttttt 


tctgtcaacc 


5400 


ttggggcttt 


tgtgtttgct 


gttcacttca 


cctcctttta 


ttgttaactt 


ctactcatct 


5460 


ttcaattttc 


aacttaagtg 


ttctcagaga 


aacctacttt 


gattttcttg 


gtccanaacg 


5520 


gttctctgga 


tgtgaactct 


tatagcacat 


aattttcact 


tttttccaca 


aaactcgctc 


5580 


20ctatcacctg 


ttacaagcat 


ttacctctga 


taacaagaac 


tttcaaatat 


ctagctgtca 


5640 


tgtaagcact 


tttcataaac 


attaagagta 


tctgtgacac 


ttatgtgtaa 


tgtttcgtat 


5700 


ctctgaaatt 


gatatttacc 


agtcatttat 


cttggctacc 


aactaacaac 


tatccatatt 


5760 


atctgtacca 


atcagatgta 


taatcacaat 


tttgtgtgac 


agaaaatggc 


taaacttgat 


5820 


ccaaggctat 


tacatgcttt 


atcaactgca 


caatctttat 


atatgtcaat 


tattgatctt 


5880 


25taactgattt 


ccttcttatg 


gattttctcc 


tctgcttatc 


atgtatgcct 


aacatgacaa 


5940 


aaaagagcct 


atcattgcag 


ccagtatgat 


aatactcagt 


ctgtggggct 


tcttatttgc 


6000 


ttattccatc 


atcatctgtc 


ctgcttgatg 


tctttgccta 


tgcacaatca 


tatgacccat 


6060 


cacatctgta 


tgaagagctg gatgactagg 


attaatattc 


tattttaggt 


tcttattcag 


6120 


cagaaatatt 


agataatcaa 


tgtcttttta 


ttcctgtagg 


tgtgaaatcc 


agacaattga 


6180 


30agatggggca 


tatcagagcc 


taagccacct 


ctctacctta 


atattgacag 


gaaaccccat 


6240 


ccagagttta 


gccctgggag 


ccttttctgg 


actatcaagt 


ttacagaagc 


tggtggctgt 


6300 


ggagacaaat 


ctagcatctc 


tagagaactt 


ccccattgga 


catctcaaaa 


ctttgaaaga 


6360 


acttaatgtg 


gctcacaatc 


ttatccaatc 


tttcaaatta 


cctgagtatt 


tttctaatct 


6420 


gaccaatcta 


gagcacttgg 


acctttccag 


caacaagatt 


caaagtattt 


attgcacaga 


6480 


35cttgcgggtt 


ctacatcaaa 


tgcccctact 


caatctctct 


ttagacctgt 


ccctgaaccc 


6540 


tatgaacttt 


atccaaccag 


gtgcatttaa 


agaaattagg 


cttcataagc 


tgactttaag 


6600 


aaataatttt 


gatagtttaa 


atgtaatgaa 


aacttgtatt 


caaggtctgg 


ctggtttaga 


6660 


agtccatcgt 


ttggttctgg 


gagaatttag 


aaatgaagga 


aacttggaaa 


agtttgacaa 


6720 


atctgctcta 


gagggcctgt 


gcaatttgac 


cattgaagaa 


ttcccgatta 


gcatacttag 


6780 


40actactacct 


cgatgatatt 


attgacttat 


ttaattggtt 


gacaaatggt 


tcttcatttt 


6840 
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gaaagggtaa 


aagacttttc 


ttataatttc 


gqatggcaac 


6900 


atttagaatt 


agttaactgt 


aaatttggac 


agtttcccac 


attgaaactc 


aaatctctca 


6960 


aaaggcttac 


tttcacttcc 


aacaaaggtg ggaatgcttt 


ttcagaagtt 


gatctaccaa 


7020 


gccttgagtt 


tctagatctc 


agtagaaatg 


gcttgagttt 


caaaggttgc 


tgt^ctcaaa 


7080 


Sgtgattttgg gacaaccagc ctaaagtatt tagatctgag cttcaatggt 


gtCattacca 


7140 


tgagttcaaa 


cttcttgggc 


ttagaacaac 


tagaacatct 


ggatttccag 


cattccaatt 


7200 


tgaaacaaat 


gagtgagttt 


tcagtattcc 


tatcactcag 


aaacctcatt 


taccttgaca 


7260 


tttctcatac 


tcacaccaga 


gttgctttca 


atggcatctt 


caatggcttg 


tccagtctcg 


7320 


aagtcttgaa 


aatggctggc 


aattctttcc 


aggaaaactt 


ccttccagat 


atcttcacag 


7380 


lOagctgagaaa -cttgaccttc ctggacctct ctcagtgtca actggagcag 


ttgtcCccaa 


7440 


cagcatttaa 


ctcactctcc 


agtcttcagg 


tactaaatat 


gagccacaac 


aac ttct ttt 


7500 


cattggatac 


gtttccttat 


aagtgtctga 


actccctcca 


ggttcttgat 


tacagtc tea 


7560 


atcacataat 


gacttccaaa 


aaacaggaac 


tacagcattt 


tccaagtagt 


ctagctttct 


7620 


taaatcttac 


tcagaatgac 


tttgcttgta 


cttgtgaaca 


ccagagtttc 


ctgcaatgga 


7680 


IStcaaggacca 


gaggcagctc 


ttggtggaag 


ttgaacgaat 


ggaatgtgca 


acacct tcag 


7740 


ataagcaggg 


catgcctgtg 


ctgagtttga 


atatcacctg 


tcagatgaat 


aagaccatca 


7800 


ttggtgtgtc 


ggtcctcagt 


gtgcttgtag 


tatctgttgt 


agcagttctg 


gtctataagt 


7860 


tctattttca 


cctgatgctt 


cttgctggct 


gcataaagta 


tggtagaggt 


gaaaacatcti 


7920 


atgatgcctt 


tgttatctac 


tcaagccagg atgaggactg ggtaaggaat 


gagctagtaa 


7980 


20agaatttaga 


agaaggggtg 


cctccatttc 


agctctgcct 


tcactacaga 


gactttattc 


8040 


ccggtgtggc 


cattgctgcc 


aacatcatcc 


atgaaggttt 


ccataaaagc 


cqaaagqtga 


8100 


ttgttgtggt 


gtcccagcac 


ttcatccaga 


gccgctggtg 


tatctttgaa 


tatgagattg 


8160 


ctcagacctg gcagtttctg 


agcagtcgtg 


ctggtatcat 


cttcattgtc 


ctgcagaagg 


8220 


tggagaagac 


cctgctcagg 


cagcaggtgg 


agctgtaccg 


ccttctcagc 


aggaacactt 


8280 


25acctggagtg ggaggacagt gtcctggggc ggcacatctt ctggagacga 


ctcagaaaag 


8340 


ccctgctgga 


tggtaaatca 


tggaatccag 


aaggaacagt 


gggtacagga 


tgcaattggc 


8400 


aggaagcaac 


atctatctga 


agaggaaaaa 


taaaaacctc 


ctgaggcatt 


tcttgcccag 


8460 


ctgggtccaa 


cacttgttca 


gttaataagt 


attaaatgct 


gccacatgtc 


aggccttatg 


8520 


ctaagggtga 


gtaattccat 


ggtgcactag 


atatgcaggg 


ctgctaatct 


caaggagctt 


8580 


30ccagtgcaga gggaataaat gctagactaa aatacagagt cttccaggtg 


ggcatttcaa 


8640 


ccaactcagt 


caaggaaccc 


atgacaaaga 


aagtcatttc 


aactcttacc 


tcatcaagtt 


8700 


gaataaagac 


agagaaaaca 


gaaagagaca 


ttgttctttt 


cctgagtctt 


ttgaatggaa 


8760 


attgtattat 


gttatagcca 


tcataaaacc 


attttggtag 


ttttgactga 


actgggtgtt 


8820 


cactttttcc 


tttttgattg 


aatacaattt 


aaattctact 


tgatgactgc 


agtcgtcaag 


8880 


35gggctcctga 


tgcaagatgc 


cccttccatt 


ttaagtctgt 


ctccttacag 


aggt taaagt 


8940 


ctagtggcta 


attcctaagg 


aaacctgatt 


aacacatgct 


cacaaccatc 


ctggtcattc 


9000 


tcgagcatgt 


tctatttttt 


aactaatcac 


ccctgatata 


tttttatttt 




9060 


gttttcattt 


ttttacgtct 


tgcctataag 


ctaatatcat 


aaataaggtt 


gtttaagacg 


9120 


tgcttcaaat 


atccatatta 


accactattt 


ttcaaggaag 


tatggaaaag 


tacactctgt 


9180 


40cactttgtca 


i ctcgatgtca 


ttccaaagtt 


attgcctact 


aagtaatgac 


tgtcatgaaa 


9240 
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gcagcattga 


aataatttgt 


ttaaaggggg 


cactctttta 


aacgggaaga 


aaatttccgc 


9300 


ttcctggtct 


tatcatggac 


aatttgggct 


ataggcatga 


aggaagtggg 


attacctcag 


9360- 


gaagtcacct 


tttcttgatt 


ccagaaacat 


atgggctgat 


aaacccgggg 


tgacctcatg 


9420 


aaatgagttg 


cagcagatgt 


ttattttttt 


cagaacaagt 


gatgtttgat 


ggacctatga 


9480 


5atctatttag 


ggagacacag 


atggctggga 


tccctcccct 


gtacccttct 


cactgccagg 


9540 


agaactacgt 


gtgaaggtat 


tcaaggcagg 


gagtatacat 


tgctgtttcc 


tgttgggcaa 


9600 


tgctccttga 


ccacattttg 


ggaagagtgg 


atgttatcat 


tgagaaaaca 


atgtgtctgg 


9660 


aattaatggg gttcttataa 


agaaggttcc 


cagaaaagaa 


tgttcattcc 


agcttcttca 


9720 


ggaaacagga 


acattcaagg 


aaaaggacaa 


tcaggatgtc 


atcagggaaa 


tgaaaataaa 


9780 


lOaaccacaatg 


agatatcacc 


ttataccagg 


tagatggcta 


ctataaaaaa 


atgaagtgtc 


9840 


atcaaggata 


tagagaaatt 


ggaacccttc 


ttcactgctg 


gagggaatgg 


aaaatggtgt 


9900 


agccgttatg 


aaaaacagta 


cggaggtttc 


tcaaaaatta 


aaaatagaac 


tgctatatga 


9960 


tccagcaatc 


tcacttctgt 


atatataccc 


aaaataattg 


aaatcagaat 


ttcaagaaaa 


10020 


tatttacact 


cccatgttca 


ttgtggcact 


cttcacaatc 


actgtttcca 


aagttatgga 


10080 


ISaacaacccaa 


atttccattg 


gaaaataaat 


ggacaaagga 


aatgtgcata 


taacgtacaa 


10140 


tggggatatt 


attcagccta 


aaaaaagggg 


ggatcctgtt 


atttatgaca 


acatgaataa 


10200 


acccggaggc 


cattatgcta 


tgtaaaatga 


gcaagtaaca 


gaaagacaaa 


tactgcctga 


10260 


tttcatttat 


atgaggttct 


aaaatagtca 


aactcataga 


agcagagaat 


agaacagtgg 


10320 


ttcctaggga 


aaaggaggaa 


gggagaaatg 


aggaaatagg 


gagttgtcta 


attggtataa 


10380 


20aattatagta 


tgcaagatga 


attagctcta 


aagatcagct 


gtatagcaga 


gttcgtataa 


10440 


tgaacaatac 


tgtattatgc 


acttaacatt 


ttgttaagag 


ggtacctctc 


atgttaagtg 


10500 


ttcttaccat 


atacatatac 


acaaggaagc 


ttttggaggt 


gatggatata 


tttattacct 


10560 


tgattgtggt 


gatggtttga 


caggtatgtg 


actatgtcta 


aactcatcaa 


attgtataca 


10620 


ttaaatatat 


gcagttttat 


aatatcaaaa 


aaaaaaaaaa 


aaaaa 




10665 
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<210> 63 
<211> 28 
30<212> DNA 

<213> Homo sapiens 

<400> 63 

attttgtttt gtttttaaag acttggcc 

35 

<210> 64 
<211> 28 
<212> DNA 

<213> Homo sapiens 

40 
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<400> 64 

gtcatgtgta atcattgcag gtggttcc 

<210> 65 
5<211> 28 
<212> DNA 

<213> Homo sapiens 
<400> 65 

lOatgtcttttt attcctgtag gtgtgaaa 

<210> 66 
<211> 30 
<212> DNA 
15<213> Artificial Sequence 

<220> 

<223> A primer 

20<400> 66 

gattagcata cttagactac tacctccatg 

<210> 67 
<211> 27 
25<212> DNA 

<213> Homo sapiens 



<400> 67 

gatcaacttc tgaaaaagca ttcccac 

30 

<210> 68 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

35 

<220> 

<223> A primer 
<400> 68 

40ggttgctgtt ctcaaagtga ttttgggaga a 



27 



31 
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<210> 69 

<211> 30 

<212> DNA 

<213> Homo sapiens 

5 

<400> 69 

acctgaagac tggagagtga gttaaatgct 

<210> 70 
10<211> 1360 
<212> DNA 

<213> Homo sapiens 
<220> 

15<221> misc_feature 
<222> (1) . . . (1360) 
<223> n = A,T,C or G 

<400> 70 

20ttccacttct aagagctgcc tagagtagtc 
ttcaagggcc tagggaaagg ggaaatgggg 
tgtacaaaaa tatgaacata attaatgcca 
aataaatttt atgttatgtc atgttatgtt 
aaaagaaaag aaagaaagta aagcttagcg 

25catgagtcga cccagctgag atctganctt 
ttcccaatct tcttgtactt ttcaaactgt 
atgtgagttt cttcgcaaga agggcgggcc 
aagtcctaac ccctctacct cagactatga 
tatgtaaggt agaatgaggt cattatggtg 

30taagaagggg agattagaat tcagacacac 
ctctagaatt gtgagaaaat gaatgtctgt 
ttatggcagc cccagcaaac taatacaaat 
aatcaccgtc atcctagaga gttacaaggg 
aattcaccaa gcccaggcag aggtcagatg 

35tcttgctgtt tctttagcca ctggtctgca 
tgacaaaaga gataactatt agagaaacaa 
cttcctctca ccctttagcc cagaactgct 
gaggaagaga agacaccagt gcctcagaaa 
gcattcacag ggccactgct gctcacagaa 

40cctcgcgcct ggctgggact ctgatcccag 



aagattatag 


agacaaaagt 


agtgcataga 


60 


agttatttat 


taatgaatag 


tggtgatgat 


120 


ctaaattgtn 


cacatacaaa 


tggtcaagat 


180 


atgtgatttt 


accataatac 


agaaaatgaa 


240 


gtttncatga 


cttgnccaat 


gcctcaaagc 


300 


cagtatattc 


cattctgaaa 


tcccagactt 


360 


gtttcagttg 


aggtttattt 


tcagttttgt 


420 


aaattgtgtc 


ctgcaaaaac 


ctacatatcg 


480 


ctgtatatgg 


agagagagcc 


ttgaaagagg 


540 


ggccctaatc 


caacataact 


ggtgtcctta 


600 


ttgctgacac 


cttgagttca 


gactggaagc 


660 


tgtttaagcc 


acccagtctg 


tggtatttcc 


720 


agtgtttcca 


cagctgaaac 


aaaattggaa 


780 


ctattttaat 


agaacctgat 


tgttttccta 


840 


actaattggg 


ataaaagcca 


actagcttcc 


900 


ggcgttttct 


tcttctaact 


tcctctcctg 


960 


aagtccagaa 


tgctaaggtt 


gccgctttca 


1020 


ttgaatacac 


caattgctgt 


ggggcggctc 


1080 


ctgctcggtc 


agacggtgat 


agcgagccac 


1140 


gcagtgagga 


tgatgccagg 


atgatgtctg 


1200 


ccatggcctt 


cctctcctgc 


gtgagaccag 


1260 
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aaagctggga gccctgcgtg gaggtatgtg gctggagtca gctcctctga actttccctc 1320 
acttctgccc agaacttctc actgtgtgcc ctggtttgtt 1360 

<210> 71 

5<211> 1333 

<212> DNA 

<213> Homo sapiens 



<400> 71 



lOcgcatcatgg atttgtgtgt catccttgtg 


caggggccat 


gctcatcttc 


tctgtatcct 


60 


tccaatttta 


gtatatgtgc 


tactgcagca 


agcacgatat 


tggatatttt 


attacctaca 


120 


ttttacatat 


gataaaatga 


ggctcactga 


ggtttttctt 


ttgttcgttt 


tattttgttt 


180 


tgtttttaaa 


gacttggccc 


taaaccacac 


agaagagctg 


gcatgaaacc 


cagagctttc 


240 


agactccgga 


gcctcagccc 


ttcaccccga 


ttccattgct 


tcttgctaaa 


tgctgccgtt 


300 


15ttatcacgga ggttagaatg ctgagcacgt 


agtaggtgct 


ctttactttc 


taatctagag 


360 


taagacaatt 


tataagcatg 


aattgagtga 


atggatggat 


ggatatatgg 


atggaaggat 


420 


ggacagatgg 


atgaaaggtt 


gactgaattt 


tgtgcttgca 


caaaaagagg 


cccctctcca 


480 


ccatctctgg 


tctaggagag 


gggagttggg 


agaccatgca 


gtaaagatac 


ttcatgtcat 


540 


gtgtaatcat 


tgcaggtggt 


tcctaatatt 


acttatcaat 


gcatggagct 


gaatttctac 


600 


20aaaatccccg 


acaacctccc 


cttctcaacc 


aagaacctgg 


acctgagctt 


taatcccctg 


660 


aggcatttag 


gcagctatag 


cttcttcagt 


ttcccagaac 


tgcaggtgct 


ggatttatcc 


720 


aggtaatgaa 


tccactttta 


catactgcac 


aaggtgaggt 


gttcattgtc 


ctatcatttc 


780 


attattggac 


tggaaagctt 


ggtttgtgga 


gtctcatctt 


cattcactta 


ttcattcata 


840 


caacagatgt 


cttattaact 


atataacctt 


gagcaagcta 


cctctattct 


ccaggtctca 


900 


25gttttctaat 


ctgtgaagta 


ggcagttggc 


tgagacagct 


tctaagggca 


attctaattt 


960 


taggttttct 


tttaagacag 


gagagaaaat 


tagcttaaat 


tctttcataa 


gcagctattt 


1020 


attgactact 


tgctatatgt 


tgtacactct 


gcaagaagac 


aggcatatat 


tgatatataa 


1080 


cacacagccc 


ctgttgttaa 


ggaggcatat 


cttcttgaaa 


gagttaatac 


cttaaagtcc 


1140 


tgggtatggt 


cctgggtaca 


tagtatatag 


tcaacacatt 


ttaattatga 


ttttttggat 


1200 


30ctggaaactg atataaagat 


agcgacatat 


aacagtaggt 


gataaattat 


gtttaaacta 


1260 


aaggtaacta 


attgtatttt 


tcagaagagg 


ggccttctct 


gtggtgggta 


gtcaagaaag 


1320 


attcatgaac 


tgc 










1333 



<210> 72 
35<211> 6786 
<212> DNA 
<213> Homo sapiens 

<220> 

40<221> misc feature 
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<222> (1) . . . (6786) 
<223> n = A,T,C or G 

<400> 72 

Sggtaagaatg ctttgtgata gcccagcagc 
tctttttata ggtgaggaac tgaagcttga 
tgcttgttag ccacagacca gagatttaag 
ttctttgtct taatactcta cttctctaaa 
cacagcaaga ccctttcctt ctcacctaat 

lOgctataacaa tcatataatt gaaagaaaga 
atttataaat gtgataacta aaacctagag 
ggagaggact gcagaaacac caaacccaag 
tactatactg cctatccttt atctagttac 
cttttattgt ctaggaaact cctgaagaag 

IStgtaattcag gctggattaa gaaagaactt 
atgctgaaag gaatttttca ttggcntgct 
taggacatgt ctctgaacat tgtgtgtagc 
aaaacccttt atctggttgg taaacctctg 
ccatattgta cactccaatt tcatctctgt 

20tcctccttgt gtcatttttt ttctgtcaac 
acctcctttt attgttaact tctactcatc 
aaacctactt tgattttctt ggtccanaac 
taattttcac ttttttccac aaaactcgct 
ataacaagaa ctttcaaata tctagctgtc 

25atctgtgaca cttatgtgta atgtttcgta 
tcttggctac caactaacaa ctatccatat 
ttttgtgtga cagaaaatgg ctaaacttga 
acaatcttta tatatgtcaa ttattgatct 
ctctgcttat catgtatgcc taacatgaca 

30taatactcag tctgtggggc .ttcttatttg 
gtctttgcct atgcacaatc atatgaccca 
gattaatatt ctattttagg ttcttattca 
attcctgtag gtgtgaaatc cagacaattg 
. tctctacctt aatattgaca ggaaacccca 

35gactatcaag tttacagaag ctggtggctg 
tccccattgg acatctcaaa actttgaaag 
ctttcaaatt acctgagtat ttttctaatc 
gcaacaagat tcaaagtatt tattgcacag 
tcaatctctc tttagacctg tccctgaacc 

40aagaaattag gcttcataag ctgactttaa 



22 



cttctttccc 


ctatagaaat 


atatatatan 


60 


ataatttaaa 


tgacttatat 


acatnatcat 


120 


ttcncatctc 


cagaatccaa 


cttaaatgtt 


180 


gtgattatca 


ccaatgtaat 


gatatagagn 


240 


gtatagagca 


atgcagagat 


agaatgatgg 


300 


acttcaaaaa 


taatcaagtt 


cagctgtttg 


360 


aggaaaagag 


gtactcaaga 


tcacacagta 


420 


ctcttttgtc 


cactcttcca 


gcgttctttc 


480 


caataaataa 


caaaagcttg 


gaccacaatg 


540 


ctaaataaaa 


tgggtgggga 


atattgtaaa 


600 


atttgacatt 


gtaactgaca 


agcacctgca 


660 


gtttgctggg 


ctgcatcaaa 


gccctgtctc 


720 


atggctttca 


tttcttttag 


gataaaattc 


780 


cctaattggg 


aaccttcttt 


ctccacaact 


840 


tctccaacca 


tggaagctat 


ttgtcatgat 


900 


cttggggctt ' 


'ttgtgtttgc 


tgttcacttc 


960 


tttcaatttt 


caacttaagt 


gttctcagag 


1020 


ggttctctgg 


atgtgaactc 


ttatagcaca 


1080 


cctatcacct 


gttacaagca 


tttacctctg 


1140 


atgtaagcac 


ttttcataaa 


cattaagagt 


1200 


tctctgaaat 


tgatatttac 


cagtcattta 


1260 


tatctgtacc 


aatcagatgt 


ataatcacaa 


1320 


tccaaggcta 


ttacatgctt 


tatcaactgc 


1380 


ttanctgatt 


tccttcttat 


ggattttctc 


1440 


aaaaagagcc 


tatcattgca 


gccagtatga 


1500 


cttattccat 


catcatctgt 


cctgcttgat 


1560 


tcacatctgt 


atgaagagct 


ggatgactag 


1620 


gcagaaatat 


tagataatca 


atgtcttttt 


1680 


aagatggggc 


atatcagagc 


ctaagccacc 


1740 


tccagagttt 


agccctggga 


gccttttctg 


1800 


tggagacaaa 


tctagcatct 


ctagagaact 


1860 


aacttaatgt 


ggctcacaat 


cttatccaat 


1920 


tgaccaatct 


agagcacttg 


gacctttcca 


1980 


acttgcgggt 


tctacatcaa 


atgcccctac 


2040 


ctatgaactt 


tatccaacca 


ggtgcattta 


2100 


gaaataattt 


tgatagttta 


aatgtaatga 


2160 
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aaacttgtat 


tcaaggtctg 


gctggtttag 


aagtccatcg 


tttggttctg 


ggagaattta 


2220 


gaaatgaagg 


aaacttggaa 


aagtttgaca 


aatctgctct 


agagggcctg 


tgcaatttga 


2280 


ccattgaaga 


attccgatta 


gcatacttag 


actactacct 


cgatgatatt 


attgacttat 


2340 


ttaattgttt 


gacaaatgtt 


tcttcatttt 


ccctggtgag 


tgtgactatt 


gaaagggtaa 


2400 


Saagacttttc 


ttataatttc 


ggatggcaac 


atttagaatt 


agttaactgt 


aaatttggac 


2460 


agtttcccac 


attgaaactc 


aaatctctca 


aaaggcttac 


tttcacttcc 


aacaaaggtg 


2520 


ggaatgcttt 


ttcagaagtt 


gatctaccaa 


gccttgagtt 


tctagatctc 


agtagaaatg 


2580 


gcttgagttt 


caaaggttgc 


tgttctcaaa 


gtgattttgg 


gacaaccagc 


ctaaagtatt 


2640 


tagatctgag 


cttcaatggt 


gttattacca 


tgagttcaaa 


cttcttgggc 


ttagaacaac 


2700 


lOtagaacatct ggatttccag 


cattccaatt 


tgaaacaaat 


gagtgagttt 


tcagtattcc 


2760 


tatcactcag 


aaacctcatt 


taccttgaca 


tttctcatac 


tcacaccaga 


gttgctttca 


2820 


atggcatctt 


caatggcttg 


tccagtctcg 


aagtcttgaa 


aatggctggc 


aattctttcc 


2880 


aggaaaactt 


ccttccagat 


atcttcacag 


agctgagaaa 


cttgaccttc 


ctggacctct 


2940 


ctcagtgtca 


actggagcag 


ttgtctccaa 


cagcatttaa 


ctcactctcc 


agtcttcagg 


3000 


IStactaaatat 


gagccacaac 


aacttctttt 


cattggatac 


gtttccttat 


aagtgtctga 


3060 


actccctcca 


ggttcttgat 


tacagtctca 


atcacataat 


gacttccaaa 


aaacaggaac 


3120 


tacagcattt 


tccaagtagt 


ctagctttct 


taaatcttac 


tcagaatgac 


tttgcttgta 


3180 


cttgtgaaca 


ccagagtttc 


ctgcaatgga 


tcaaggacca 


gaggcagctc 


ttggtggaag 


3240 


ttgaacgaat 


ggaatgtgca 


acaccttcag 


ataagcaggg 


catgcctgtg 


ctgagtttga 


3300 


20atatcacctg 


tcagatgaat 


aagaccatca 


ttggtgtgtc 


ggtcctcagt 


gtgcttgtag 


3360 


tatctgttgt 


agcagttctg 


gtctataagt 


tctattttca 


cctgatgctt 


cttgctggct 


3420 


gcataaagta 


tggtagaggt 


gaaaacatct 


atgatgcctt 


tgttatctac 


tcaagccagg 


34 80 


atgaggactg 


ggtaaggaat 


gagctagtaa 


agaatttaga 


agaaggggtg 


cctccatttc 


3540 


agctctgcct 


tcactacaga 


gactttattc 


ccggtgtggc 


cattgctgcc 


aacatcatcc 


3600 


25atgaaggttt 


ccataaaagc 


cgaaaggtga 


ttgttgtggt 


gtcccagcac 


ttcatccaga 


3660 


gccgctggtg 


tatctttgaa 


tatgagattg 


ctcagacctg 


gcagtttctg 


agcagtcgtg 


3720 


ctggtatcat 


cttcattgtc 


ctgcagaagg 


tggagaagac 


cctgctcagg 


cagcaggtgg 


3780 


agctgtaccg 


ccttctcagc 


aggaacactt 


acctggagtg 


ggaggacagt 


gtcctggggc 


3840 


ggcacatctt 


ctggagacga 


ctcagaaaag 


ccctgctgga 


tggtaaatca 


tggaatccag 


3900 


BOaaggaacagt 


gggtacagga 


tgcaattggc 


aggaagcaac 


atctatctga 


agaggaaaaa 


3960 


taaaaacctc 


ctgaggcatt 


tcttgcccag 


ctgggtccaa 


cacttgttca 


gttaataagt 


4020 


attaaatgct 


gccacatgtc 


aggccttatg 


ctaagggtga 


gtaattccat 


ggtgcactag 


4080 


atatgcaggg 


ctgctaatct 


caaggagctt 


ccagtgcaga 


gggaataaat 


gctagactaa 


4140 


aatacagagt 


cttccaggtg 


ggcatttcaa 


ccaactcagt 


caaggaaccc 


atgacaaaga 


4200 


35aagtcatttc 


aactcttacc 


tcatcaagtt 


gaataaagac 


agagaaaaca 


gaaagagaca 


4260 


ttgttctttt 


cctgagtctt 


ttgaatggaa 


attgtattat 


gttatagcca 


tcataaaacc 


4320 


attttggtag 


ttttgactga 


actgggtgtt 


cactttttcc 


tttttgattg 


aatacaattt 


4380 


aaattctact 


tgatgactgc 


agtcgtcaag 


gggctcctga 


tgcaagatgc 


cccttccatt 


4440 


ttaagtctgt 


ctccttacag 


aggttaaagt 


ctagtggcta 


attcctaagg 


aaacctgatt 


4500 


40aacacatgct 


cacaaccatc 


ctggtcattc 


tcgagcatgt 


tctatttttt 


aactaatcac 
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Amino Acid 
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HUMAN TLR4 GENOMIC SEQUENCE 

AAAATACTCC CTTGCCTCAA AAACTGCTCG GTCAAACGGT 
GATAGCAAAC CACGCATTCA CAGGGCCACT GCTGCTCACA 
AAACCAGTGA GGATGATGCC AGGATGATGT CTGCCTCGCG 
CCTGGCTGGG ACTCTGATCC CAGCCATGGC CTTCCTCTCC 
TGCGTGAGAC CAGAAAGCTG GGAGCCCTGC GTGGAGGTAT 
GTGGCTGGAG TCAGCTCCTC TGAACTTTCC CTCACTTCTG 
CCCAGAACTT CTCACTGTGT GCCCTGGTTT GTTTATTTTT 
GCAAAAAAAA AAAGAGTTAA ATTACCTTAA AGACTCAAGA 
AGCCACAGAG ATCAAATAAT TCATTGTTAC AGGGCACTAG 
AGGCAGCCAT TGGGGGTTTG TTCCATTTGG AAATTTTGAG 
TGCTAACAGG GGCATGAGAT AACATAGATC TGCTTAAGGT 
CCCTGCTCTG CTACCTTGTG GCTCTGTGAA GAAATTATCA 
AACCTGTCTG AGACTAGTTT TCGCATCTGT AAGAGAATTA 
TAATACCTTC TTCACTAGAG AGTAAGCAGA CTGCTTCAGT 
GTCATTTCTT CCCACTGGTG GTCTTTACAC TCAGCTTCAA 
GCAGTCACCC TGCTCCTTTC AATCTCAGGA AAAAGATGGC 
TTTGTGTGTG TGTCTCT:A: G:AGAAAGAA CTTTCTAAGT 
TGGTGCAGA CTTCTGTATG CAGTAATATA GTTTAGTCCA 
GAGGATGAAA AAAATAAGAG ArATGAAAAA GGAAAAGAGA 
GAGAGAGAzG AAGAAAAAAG C/WVGAGGGAA ATiATGTATA 
ATGTCAGCTA ATGCAAC:AG TTTCTTTCTT AGTGAAATAC 
CAATCAGCTG :GTTG:GTAA TCTT:ATTCA TGATGGATCf 
CTTTTGTTTT TCCCCTGCGC AGACTTCrAC AGTTGCTTTA 
GAAACCCATA GTAGAGCCGA A:CAGCTAAG AAAATGATTT 
ACAGTGAGGC AGGGTCAGAA ACTCAAGAGA GAAAAAGCCA 
GCTGCAGTC: CTGAAGTiTG AGGATATAGG :AGAAAATCA 
AGTAATATTT AGCAAAGACT AATTCATTAT CTTGAAGCCA 
TCCCTTCCCT CAATTCCCTG CCCATAGTCC TCCTCCTTGT 
CCTCTTCTCT GNA:TCCCTC TGCTGTTAGG TTA:ATGG:A 
GATAGATTTT CTAATTANGC TCACTGCGAG ATAAAACCCA 
GCCCATGTTT CTATTAGNCA ATATTGTCTT TGAGGCTCCA 
TGGCTTGCAN CATTTAAGCA GACATACGAA TGAAGATCTG 
CATGTTTGAA CTCTGACTTT GCGCATATTA CTTCATTTCT 
TTGAATTTCC ATTTTCCTCA TCTTTAAATG CTTATTTGAA 
GATTAAGTGA AAGTATATAA CAAACAAGAA CTATGCAGGC 
GTATGGTAAG GGATTAATGA TAGATGATAA TAATTAATGT 
TGACATCTAT TGATCACTTA TACTGTAGCG GGCTTTTAAA 
TAAACTCTTT AAACACCTTA TCTCATTTAA TCCTTCAAAC 
ATTCTATTGG TTTCAAACAA CAGAAAACTA CAATTAGCTG 
GCTTCTGCAA GGAATTTTGT TGGAGGAAAT GAGAGCATTC 
AGAAATTAGA TGGGAGCGTT AGAGAATTAG GCTTACAAAG 
AATGTGGGAA AGTAGGCTAG AAAGCAGTGT AAAAACAAAG 
ACAGCATAAA GCACTTGACC TTATTTACTA GGTTCCACCA 
TGGGAATCCA TGCACTCTAA AGATTTCCCC CTATTTCTAC 
ATCACTTTGC TCAAGGGTCA ATGAGCC/^G GAAAAGAATG 
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CAGTTGTCAA AATCTGGGCC ATGACTAAGG AAGGTCTGGA 
CATCTTGACT GCCAGACAGT CTCCCCAATG ATATGGAGTA 
TTTAGAATGA TACTGGATAT TTTATTTATT TTTTGTATTT 
TCAACTTTTA AGTTCAGAGG CACATGTGCA GAGCATGCAG 
GTTTATTACA TAAGTAAATG TGTGCCATGG TGATTTGCTG 
CATAGATCAT GAAAATATGG AACGCATCAT GGATTTGTGT 
GTCATCCTTG TGCAGGGGCC ATGCTCATCT TCTCTGTATC 
CTTCCAATTT TAGTATATGT GCTACTGCAG CAAGCACGAT 
ATTGGATATT TTATTACCTA CATTTTACAT ATGATAAAAT 
GAGGCTCACT GAGGTTTTTC TTTTGTTCGT TTTATTTTGT 
TTTGTTTTTA AAGACTTGGC CCTAAACCAC ACAGAAGAGC 
TGGCATGAAA CCCAGAGCTT TCAGACTCCG GAGCCTCAGC 
CCTTCACCCC GATTCCATTG CTTCTTGCTA AATGCTGCCG 
TTTTATCNCG GAGGTTAGAA TGCTGAGCAC GTAGTAGGTG 
CTCTTTACTT TCTAATCTAG AGTAAGACAA TTTATAAGCA 
TGAATTGAGT GAATGGATGG ATGGATATAT GGATGGAAGG 
ATGGACAGAT GGATGAAAGG TTGACTGAAT TTTGTGCTTG 
CACAAAAAGA GGCCCCTCTC CACCATCTCT GGTCTAGGAG 
AGGGGAGTTG GGAGACCATG CAGTAAAGAT ACTTCAyGTC 
ATGTGTAATC ATTGCAGGTG GTTCCTAATA TTACTTATCA 
ATGCATGGAG CTGAATTTCT ACAATU^TCCC CGACAACCTC 
CCCTTCTCAA CCAAGAACCT GGACCTGAGC TTTAATCCCC 
TGAGGCATTT AGGCAGCTAT AGCTTCTTCA GTTTCCCAGA 
ACTGCAGGTG CTGGATTTAT CCAGGTAATG AATCCACTTT 
TACATACTGC ACAAGGTGAG GTGTTCATTG TCCTATCATT 
TCATTATTGG ACTGGAAAGC TTGGTTTGTG GAGTCTCATC 
TTCATTCACT TATTCATTCA TACAACAGAT GTCTTATTAA 
CTATATAACC TTGAGCAAGC TACCTCTATT CTCCAGGTCT 
CAGTTTTCTA ATCTGTGAAG TAGGCAGTTG GCTGAGACAG 
CTTCTAAGGG CAATTCTAAT TTTAGGTTTT CTTTTAAGAC 
AGGAGAGAAA ATTAGCTTAA ATTCTTTCAT AAGCAGCTAT 
TTATTGACTA CTTGCTATAT GTTGTACACT CTGCAAGAAG 
ACAGGCATAT ATTGATATAT AACACACAGC CCCTGTTGTT 
AAGGAGGCAT ATCTTCTTGA AAGAGTTAAT ACCTTAAAGT 
CCTGGGTATG GTCCTGGGTA CATAGTATAT AGTCAACACA 
TTTTAATTAT GATTTTTTGG ATCTGGAAAC TGATATAAAG 
ATAGCGACAT ATAACAGTAG GTGATAAATT ATGTTTAAAC 
TAAAGGTAAC TAATTGTATT TTTCAGAAGA GGGGCCTTCT 
CTGTGGTGGG TAGTCAAGAA AGATTCATGA ACTGCATAAG 
ATTCAAACAA TGTCTAGAAT ATTAAAACTA GTGGTGGCAG 
GTGAAATGTC ATCTTGATAT TTTAGGGGAA CCAAATTCTA 
AAAGGGTTTT CATCATCGGG GCCTTATTTG CAAATCGAAC 
TAGATAATGG ATCATGTTCT CTGCAATGGT TTGTAAAACA 
TTTCAAAACA TTTTACATAT TTTTTATTAT AGAAATTATT 
GATAAAGACT AAGGTCACAG TATAAAAATC CTTTTTAGAG 
CAGACATTTC TGTAGAAGAG TGAACATATG ACCTATTATA 
CTCTAATTTG GATATAGATA GGATGTAACA AAGGAGTAAT 
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GGGAACAATT CAAAGGCAGT GGTATAGTGC ATANAGTCCT 
GTTGGGGTCA GAAGACCTGA GCCCAAGTTT ACCCCCAACA 
TTTATAACCC ATGTAACCTT AGCATATTAC TTCATCTCCC 
TTAATCCTTA GTTTCATATC TGATCAATGG AAATGATGAA 
ACTTATTCTG CTGGATTAAA TGTGATAATA AATATTAATA 
TGCTGTATAT ATTTAAATTT TTATAAAATA TATTTTATAA 
GCATAAAGTA TTCTTACAGA ATTTCATTAG GTTTTTAAAA 
TAATTTCAAC TTTTATTTTT GATTCAGGGA TTTACATGGT 
TATATTGCGT AATGCTGAGG TGTAGGGTAC AATCGATACC 
ATCACTCAGG TAGTGAGCAT AGTACCCAAT AGTTAGTTTT 
TCAACCCTTG CTGCTTTCTC TCTATCCCCT CTCTAGTAAT 
CCCCAGGGTC TATTTTTGTC ATCTTTATGT CCATGTGTAC 
TCCATGTTTG GATCCTACTT ATAAAGTGAG AACTCATGGT 
ATTTGGCTTT CTGTNCCTTT GTTNGCTAAT TTGCTTAGGA 
TAATGGCTAC TAGCTGCATC TATGCCATTA TGTTCTAAAT 
TTCANTTNCC TGCATGAAAA TTTTGTCAAG TACTCTATTA 
AGGTAGACCA CCTCTCCCTT TTTTTTTCAA ACAAGAAGTA 
GNTTTTCCCA AACAATGCCC TTATGGAATT NATCTTCAAT 
CCNNGGATAC CCAATAACTT GCCCCAAANC CTTAATCTGN 
CTTACAGAGA GGCCACCTTC CTTCTGTAAC CCATAGGAGA 
TTTGGATTGG TAAGAATGCT TTGTGATAGC CCAGCAGCCT 
TCTTTCCCCT ATAGAAATAT ATATATANTC TTTTTATAGG 
TGAGGPiACTG AAGCTTGAAT AATTTAAATG ACTTATATAC 
ATNATCATTG CTTGTTAGCC ACAGACCAGA GATTTAAGTT 
CNCATCTCCA GAATCCAACT TAAATGTTTT CTTTGTCTTA 
ATACTCTACT TCTCTAAAGT GATTATCACC AATGTAATGA 
TATAGAGNCA CAGCAAGACC CTTTCCTTCT CACCTAATGT 
ATAGAGCAAT GCAGAGATAG AATGATGGGC TATAACAATC 
ATATAATTGA AAGAAAGAAC TTCAAAAATA ATCAAGTTCA 
GCTGTTTGAT TTATAAATGT GATAACTAAA ACCTAGAGAG 
GAAAAGAGGT ACTCAAGATC ACACAGTAGG AGAGGACTGC 
AGAAACACCA AACCCAAGCT CTTTTGTCCA CTCTTCCAGC 
GTTCTTTCTA CTATACTGCC TATCCTTTAT CTAGTTACCA 
ATAAATAACA AAAGCTTGGA CCACAATGCT TTTATTGTCT 
AGGAAACTCC TG7UVGAAGCT AAATAAAATG GGTGGGGAAT 
ATTGTAAATG TAATTCAGGC TGGATTAAGA AAGAACTTAT 
TTGACATTGT AACTGACAAG CACCTGCAAT GCTGAAAGGA 
ATTTTTCATT GGCNTGCTGT TTGCTGGGCT GCATCAAAGC 
CCTGTCTCTA GGACATGTCT CTGAACATTG TGTGTAGCAT 
GGCTTTCATT TCTTTTAGGA TAAAATTCAA AACCCTTTAT 
CTGGTTGGTA AACCTCTGCC TAATTGGGAA CCTTCTTTCT 
CCACAACTCC ATATTGTACA CTCCAATTTC ATCTCTGTTC 
TCCAACCATG GAAGCTATTT GTCATGATTC CTCCTTGTGT 
CATTTTTTTT CTGTCAACCT TGGGGCTTTT GTGTTTGCTG 
TTCACTTCAC CTCCTTTTAT TGTTAACTTC TACTCATCTT 
TCAATTTTCA ACTTAAGTGT TCTCAGAGAA ACCTACTTTG 
ATTTTCTTGG TCCANAACGG TTCTCTGGAT GTGAACTCTT 
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ATAGCACATA ATTTTCACTT TTTTCCACAA AACTCGCTCC 
TATCACCTGT TACAAGCATT TACCTCTGAT AACAAGAACT 
TTCAAATATC TAGCTGTCAT GTAAGCACTT TTCATAAACA 
TTAAGAGTAT CTGTGACACT TATGTGTAAT GTTTCGTATC 
TCTGAAATTG ATATTTACCA GTCATTTATC TTGGCTACCA 
ACTAACAACT ATCCATATTA TCTGTACCAA TCAGATGTAT 
AATCACAATT TTGTGTGACA GAAAATGGCT AAACTTGATC 
CAAGGCTATT ACATGCTTT: ATCAACTGCA CAATCTTTAT 
ATATGTCAAT TATTGATCTT TAACTGATTT CCTTCTTATG 
:GATTTTCTC CTCTGCTTAT CATGTATGCC TAACATrGAC 
AAAAAAG : AG . CCTA : TCATT GCAGCCAGTA TGATAATACT 
CA:GTCTGTG GGGCTTCTTA TTTGCTTAT : TCCATCATCA 
TCTGTCCTGC TTGATGTCTT TGCCTATGCA CAATCATATG 
:ACCCATCAC ATCTGTATGA AGAGC : TGGA TGACTAGGAT 
TAATATTCT : AT : : : TTTAG GTTCTTATT : CAGCAGAAAT 
ATTAGATAA: TCAATGTCTT TTTATTCCTG TAGGTGTGAA 
ATCCAGACAA' TTGAAGATGG GGCATATCAG AGCCTrAAGC 
CACCTCTCTA CCTTAATATT GACAGGAAAC CCCATCCAGA 
GTTTAGCCCT GGGAGCCTTT TCTGGACTAT CAAGTTTACA 
GAAGCTGGTG GCTGTGGAGA CAAATCTAGC ATCTCTAGAG 
AACTTCCCCA TTGGACATCT CAAAACTTTG AAAGAACTTA 
ATGTGGCTCA CAATCTTATC CAATCTTTCA AATTACCTGA 
GTATTTTTCT AATCTGACCA ATCTAGAGCA CTTGGACCTT 
TCCAGCAACA AGATTCAAAG TATTTATTGC ACAGACTTGC 
GGGTTCTACA TCAAATGCCC CTACTCAATC TCTCTTTAGA 
CCTGTCCCTG AACCCTATGA ACTTTATCCA ACCAGGTGCA 
TTTAAAGAi\A TTAGGCTTCA TAAGCTGACT TTAAGAAATA 
ATTTTGATAG TTTAAATGTA ATGAAAACTT. GTATTCAAGG 
TCTGGCTGGT TTAGAAGTCC ATCGTTTGGT TCTGGGAGAA 
TTTAGAAATG AAGGAAACTT GGAAAAGTTT GACAAATCTG 
CTCTAGAGGG CCTGTGCAAT TTGACCATTG AAGAATTCCC 
GATTAGCATA CTTAGACTAC TACCTCGATG ATATTATTGA 
CTTATTTAAT TGGTTGACAA ATGGTTCTTC ATTTTCCCTG 
GTGAGTGTGA CTATTGAAAG GGTAAAAGAC TTTTCTTATA 
ATTTCGGATG GCAACATTTA GAATTAGTTA ACTGTAAATT 
TGGACAGTTT CCCACATTGA AACTCAAATC TCTCAAAAGG 
CTTACTTTCA CTTCCAACAA AGGTGGGAAT GCTTTTTCAG 
AAGTTGATCT ACCAAGCCTT GAGTTTCTAG ATCTCAGTAG 
AT^TGGCTTG AGTTTCAAAG GTTGCTGTTC TCAAAGTGAT 
TTTGGGACAA CCA:GCCT:A AAGTATTTAG ATCTGAGCTT 
CAATGGTGTT ArTTACCATG AGTTCAAACT TCTTGGGCTT 
AGAACA:ACT AGAACATCTG GATTTCCAGC ATTCCAATTT 
GAAACA:AAT GAGTGAGTTT TCAGTATTCC TA:TCACTCA 
GAAA:CCT:C ATTTACCTTG ACATTTCTCA TACTCACACC 
AGAGTTGCTT TCAATGGCAT CTTC7y\TGGC TTGTCCAGTC 
TCGAAGTCTT GAAAATGGCT GGCAATTCTT TCCAGGAAAA 
CTTCCTTCCA GATATCTTCA CAGAGCTGAG AAACTTGACC 
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TTCCTGGACC TCTCTCAGTG 
CAACAGCATT TAACTCACTC 
TATGAGCCAC AACAACTTCT 
TATAAGTGTC TGAACTCCCT 
TCAATCACAT AATGACTTCC 
TTTTCCAAGT AGTCTAGCTT 
GACTTTGCTT GTACTTGTGA 
GGATCAAGGA CCAGAGGCAG 
AATGGAATGT GCAACACCTT 
GTGCTGAGTT TGAATATCAC 
TCATTGGTGT GTCGGTCCTC 
TGTAGCAGTT CTGGTCTATA 
CTTCTTGCTG GCTGCATAAA 
TCTATGATGC CTTTGTTATC 
CTGGGTAAGG AATGAGCTAG 
GTGCCTCCAT TTCAGCTCTG 
TTCCCGGTGT GGCCATTGCT 
TTTCCATAAA AGCCGAAAGG 
CACTTCATCC AGAGCCGCTG 
TTGCTCAGAC CTGGCAGTTT 
CATCTTCATT GTCCTGCAGA 
AGGCAGCAGG TGGAGCTGTA 
CTTACCTGGA GTGGGAGGAC 
CTTCTGGAGA CGACTCAGAA 
TCATGGAATC CAGAAGGAAC 
GGCAGGAAGC AACATCTATC 
CTCCTGAGGC ATTTCTTGCC 
TCAGTTAATA AGTATTAAAT 
ATGCTAAGGG TGAGTAATTC 
GGGCTGCTAA TCTCAAGGAG 
AATGCTAGAC TAAAATACAG 
CAACCAACTC AGTCAAGGAA 
TTCAACTCTT ACCTCATCAA 
ACAGAAAGAG ACATTGTTCT 
GAAATTGTAT TATGTTATAG 
TAGTTTTGAC TGAACTGGGT 
TTGAATACAA TTTAAATTCT 
AAGGGGCTCC TGATGCAAGA 
TGTCTCCTTA CAGAGGTTAA 
AGGAAACCTG ATTAACACAT 
TTCTCGAGCA TGTTCTATTT 
ATATTTTTAT TTTTATATAT 
TCTTGCCTAT AAGCTAATAT 
ACGTGCTTCA AATATCCATA 
AAGTATGGAA AAGTACACTC 
TCATTCCAAA GTTATTGCCT 
AAAGCAGCAT TGAAATAATT 



TCAACTGGAG CAGTTGTCTC 
TCCAGTCTTC AGGTACTAAA 
TTTCATTGGA TACGTTTCCT 
CCAGGTTCTT GATTACAGTC 
AAAAAACAGG AACTACAGCA 
TCTTAAATCT TACTCAGAAT 
ACACCAGAGT TTCCTGCAAT 
CTCTTGGTGG AAGTTGAACG 
CAGATAAGCA GGGCATGCCT 
CTGTCAGATG AATAAGACCA 
AGTGTGCTTG TAGTATCTGT 
AGTTCTATTT TCACCTGATG 
GTATGGTAGA GGTGAAAACA 
TACTCAAGCC AGGATGAGGA 
TAAAGAATTT AGAAGAAGGG 
CCTTCACTAC AGAGACTTTA 
GCCAACATCA TCCATGAAGG 
TGATTGTTGT GGTGTCCCAG 
GTGTATCTTT GAATATGAGA 
CTGAGCAGTC GTGCTGGTAT 
AGGTGGAGAA GACCCTGCTC 
CCGCCTTCTC AGCAGGAACA 
AGTGTCCTGG GGCGGCACAT 
AAGCCCTGCT GGATGGTAAA 
AGTGGGTACA GGATGCAATT 
TGAAGAGGAA A7ATAAAAAC 
CAGCTGGGTC CAACACTTGT 
GCTGCCACAT GTCAGGCCTT 
CATGGTGCAC TAGATATGCA 
CTTCCAGTGC AGAGGGAATA 
AGTCTTCCAG GTGGGCATTT 
CCCATGACAA AGAAAGTCAT 
GTTGAATAAA GACAGAGAAA 
TTTCCTGAGT CTTTTGAATG 
CCATCATAT^ ACCATTTTGG 
GTTCACTTTT TCCTTTTTGA 
ACTTGATGAC TGCAGTCGTC 
TGCCCCTTCC ATTTTAAGTC 
AGTCTAGTGG CTAATTCCTA 
GCTCACAACC ATCCTGGTCA 
TTTAACTAAT CACCCCTGAT 
CCAGTTTTCA TTTTTTTACG 
CATAAATAAG GTTGTTTAAG 
TTAACCACTA TTTTTCAAGG 
TGTCACTTTG TCACTCGATG 
ACTAAGTAAT GACTGTCATG 
TGTTTAAAGG GGGCACTCTT 
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TTAAACGGGA AGAAAATTTC 
GACAATTTGG GCTATAGGCA 
CAGGAAGTCA CCTTTTCTTG 
GATAAACCCG GGGTGACCTC 
TGTTTATTTT TTTCAGAACA 
TGAATCTATT TAGGGAGACA 
CCTGTACCCT TCTCACTGCC 
TATTCAAGGC AGGGAGTATA 
CAATGCTCCT TGACCACATT 
CATTGAGAAA ACAATGTGTC 
TAAAGAAGGT TCCCAGAAAA 
TCAGGAAACA GGAACATTCA 
GTCATCAGGG AAATGAAA-AT 
ACCTTATACC AGGTAGATGG 
GTCATCAAGG ATATAGAGAA 
CTGGAGGGAA TGGAAAATGG 
GTACGGAGGT TTCTCAAAAA 
TGATCCAGCA ATCTCACTTC 
TTGAAATCAG AATTTCAAGA 
TCATTGTGGC ACTCTTCACA 
GGAAACAACC CT^AATTTCCA 
GGAAATGTGC ATATAACGTA 
CTAAAAAAAG GGGGGATCCT 
TAAACCCGGA GGCCATTATG 
ACAGAAAGAC AAATACTGCC 
TCTAAAATAG TCAAACTCAT 
TGGTTCCTAG GGAAAAGGAG 
AGGGAGTTGT CTAATTGGTA 
TGAATTAGCT CTAAAGATCA 
TAATGAACAA TACTGTATTA 
GAGGGTACCT CTCATGTTAA 
TACACAAGGA AGCTTTTGGA 
CCTTGATTGT GGTGATGGTT 
CTAAACTCAT CAAATTGTAT 
TATAATATCA AAAAAAAAAA 



CGCTTCCTGG TCTTATCATG 
TGAAGGAAGT GGGATTACCT 
ATTCCAGAAA CATATGGGCT 
ATGAAATGAG TTGCAGCAGA 
AGTGATGTTT GATGGACCTA 
CAGATGGCTG GGATCCCTCC 
AGGAGAACTA CGTGTGAAGG 
CATTGCTGTT TCCTGTTGGG 
TTGGGAAGAG TGGATGTTAT 
TGGAATTAAT GGGGTTCTTA 
GAATGTTCAT TCCAGCTTCT 
AGGAAAAGGA CAATCAGGAT 
AAAAACCACA ATGAGATATC 
CTACTATAAA AAAATGAAGT 
ATTGGAACCC TTCTTCACTG 
TGTAGCCGTT ATGAAAAACA 
TTAAAAATAG AACTGCTATA 
TGTATATATA CCCAAAATAA 
AAATATTTAC ACTCCCATGT 
ATCACTGTTT CCAAAGTTAT 
TTGGAAAATA AATGGACAAA 
CAATGGGGAT ATTATTCAGC 
GTTATTTATG ACAACATGAA 
CTATGTAAAA TGAGCAAGTA 
TGATTTCATT TATATGAGGT 
AGAAGCAGAG AATAGAACAG 
GAAGGGAGAA ATGAGGAAAT 
TAAAATTATA GTATGCAAGA 
GCTGTATAGC AGAGTTCGTA 
TGCACTTAAC ATTTTGTTAA 
GTGTTCTTAC CATATACATA 
GGTGATGGAT ATATTTATTA 
TGACAGGTAT GTGACTATGT 
ACATTAAATA TATGCAGTTT 
AAAAAAAA 
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